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UNIVERSALITY FOR THE LARGEST EIGENVALUE OF A 
CLASS OF SAMPLE COVARIANCE MATRICES 

ZHIGANG BAG, GUANGMING PAN, AND WANG ZHOU 



Abstract. In this paper, we will derive the universality of the largest eigenvalue 

of a class of real or complex large dimensional sample covariance matrices in the 

form of Wjv = E^/^XJf*E^/^. Here X = {xij)M,N is an M x N random ma- 

t*^ ' trix with independent entries Xij, I < i < M, 1 < j < N such that Kxij = 0, 

^l^ijP ~ 1/Af. We say Wat is a classical complex sample covariance matrix if 
there also exists Exfj = 0,1 < i < M, 1 < j < N. Moreover, on dimensions we 



assume that M = M{N) and N/M ^> d £ (0, oo) as TV -> oo. For a class of de- 
terministic positive definite M x M matrix S, we show that the limiting behavior 
of the largest eigenvalue of Wn is universal under some additional assumptions 
on the distributions of (xijYs by pursuing a Green function comparison strategy 
Q^ ■ raised in [211 122) by Erdos, Yau and Yin for Wigner matrices and extended by 

Q, ' Pillai and Yin [30] to sample covariance matrices in the null case (E = J). Gonse- 

quently, in the classical complex case, combing this universality property and the 
results known for Gaussian case derived by El Karoui in 10 (nonsingular case) 
C^ I and Onatski in [28] (singular case) we show that after appropriate normalization 

the largest eigenvalue of Wn converges weakly to the type 2 Tracy- Widom distri- 
bution TW2. Moreover, it is straightforward to extend our argument to establish 
the universality for the limiting joint distribution of the largest k eigenvalues of 
^^ . Wn with any fixed positive integer k. 

o 

0\ ' 1. Introduction 

•y^ ■ In recent decades, researchers working on multivariate analysis have a growing in- 

■r^lj- I terest in large dimensional data arising from various fields such as genomics, image 

^^ ■ processing, microarray, proteomics and finance, to name but a few. Consequently, 

the spectral analysis of large dimensional sample covariance matrices has attracted 
considerable interest among mathematicians, probabilitists and statisticians. The 
study towards the eigenvalues of sample covariance matrices traces back to the work 
^ ! of Hsu [23j , and became flourishing after the seminal work of Marcenko and Pastur 

[27] . In [27], the authors raised the limiting spectral distribution (MP type distribu- 
tion) for a class of sample covariance matrices. After that, a lot of researchers took 
part in developing the asymptotic theory of the empirical spectral distribution of 
large dimensional sample covariance matrices, one can refer to [5l[9l[38] for instance. 
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In this paper, we will focus on the limiting behavior of the largest eigenvalues of 
a class of large dimensional sample covariance matrices. Precisely, we will consider 
the sample covariance matrix in the form of 

Wn = T}'^XX*T}'\ X = (x,,)M,iv, (1.1) 

where {xij := Xij{N), 1 < i < M := M{N), 1 < j < N} is a collection of indepen- 
dent real or complex variables such that 

Exjj = 0, E|xij| = — . 

Moreover, we say Wn is a classical complex sample covariance matrix if there also 
exists 

Exjj = 0, l<i<M,l<j<N. 

Here S := S^r is an M x M positive definite matrix. Thus if the columns of X 
are i.i.d, Wn can be viewed as the sample covariance matrix of A^ samples of the 
random vector S^' ^x, where x is a random vector which has the same distribution 
as each column of y/NX. We call Wat is a Wishart matrix if (xjj)'s are Gaussian. 

As is well known, the limiting distributions of the largest eigenvalues for classical 
large dimensional random matrices were originally raised by Tracy and Widom in 
[MIEZ] for Gaussian Wigner ensemble G(0/U/S)E, thus are named as Tracy- Widom 
law of type /3 (/3=1,2,4 for GOE,GUE,GSE respectively) which wih be denoted 
by TW^ hereafter. Later on, in [25j Johansson proved that when Wat in (jl.ip is 
a classical complex Wishart matrix in the null case (S = Im)-, after appropriate 
normalization the largest eigenvalue of Wat converges to TW2 weakly. After that, 
Johnstone solved the real Wishart case in [26j and show that when S = Im, the 
largest eigenvalue of Wn converges to TWi weakly after appropriate normalization. 

For the non-null case (S 7^ Im)-, rnost works have been devoted to deal with 
the so called spiked model. We say Wat is spiked when there are finite number of 
eigenvalues of S not equal to 1. On spiked Wishart model, one can refer to [6j and 
[8] for classical complex and real case respectively. Under more general setting of S, 
the most relevant result was given by El Karoui in [10] for classical complex non-null 
Wishart matrices with a general assumption on S when d > 1 (nonsingular case), 
and extended by Onatski to the singular case (0 < d < 1) in [28] later . 

Note that all the results mentioned above are particularly for the Gaussian matri- 
ces. A central issue in Random Matrix Theory is the so called universality property 
for spectrum statistics. That means, the limiting behavior of an eigenvalue statis- 
tic usually does not depend on the details of the distribution of the matrix entries 
thus should hold under more general distribution assumption (but not particularly 
for Gaussian variables). The universality problem of the extreme eigenvalues are 
usually refer to as the edge universality problem, in contrast to the so called bulk 
universality problem which concerns the universal properties of the eigenvalues lying 
in the bulk of the spectrum. Specifically, for sample covariance matrices, in the null 
case, Tracy- Widom law for the extreme eigenvalues of Wishart matrices has been ex- 
tended to more general distributed Wn-, one can see [331 123 E21 EH] for instance. For 



generally distributed spiked sample covariance matrices, the universality property 
was also partially solved in [3| and |24j . 

In this paper, we will prove the universality of the largest eigenvalues of a class 
of non-null Wn under a general setting on S which is analogous to those in [TU] and 
|28j . A direct consequence of such a universality property and the results in [10] and 
|28j is that the TW2 law also holds for generally distributed classical complex Wn 
under our setting on E. This work can be viewed as a generalization of the edge 
universality in the null case (S = /) provided by Pillai and Yin in [30] to a class of 
non-null sample covariance matrices. In the sequel, we will present our main results 
and introduce some notations at first. And after that, we will give an introduction 
of the proof route which relies on a Green function comparison strategy raised by 
Erdos, Yau and Yin in J2H [22] for Wigner matrices originally and extended by Pillai 
and Yin [30] to sample covariance matrices in the null case. 

To state our results, we introduce some notations at first. In the sequel, we will 
denote the ordered eigenvalues of an n x n Hermitian matrix A by 

Xn{A)<...<\2{A)<Xi{A). (1.2) 

For ease of presentation, we set 

N 
dAT := — —)• d G (0, 00), as N ^ 00. 

We denote the empirical spectral distribution (ESD) of S by 

1 *^ 



i=l 



and that of Wn by 



1 ^ 

i=l 

Here 1§ represents the indicator function of the event S. 

To state our main result, we need to introduce a crucial parameter c =: c(Il, A^, M). 
Let 

c:=c(S,A,M), cG [0,l/Ai(E)), 

such that 



Ac 



1-Ac 



dHN{X) = (In- (1.3) 



It is easy to check that the definition of c is unique. Now we can state our main 
condition as follows. 

Condition 1.1. Throughout the paper, we will need the following conditions. 

(i)(On dimensions): We assume that there are some positive constants ci and Ci 
such that 

ci < d]s[ < Ci. 



4 ZHIGANG BAD, GUANGMING PAN, AND WANG ZHOU 

(a) (On X): We assume that {xij := Xij{N),l < i < M,l < j < N} is a 
collection of independent real or complex variables such that 

1 

N' 



Kxij = 0, IE|a;jj| 



Moreover, we assume that \/Nxij's have a sub- exponential tail, i.e. there exists 
some positive constant tq independent ofi,j,N such that for sufficiently large t, one 
has 

H\VNxij\ >t)< To-^exp(-r°). (1.4) 

(Hi) (On Ti): We assume that 



liminf Am(5^) > 0, limsupAi(Il) < oo 

N ^ 



and 



limsupAi(S)c < 1. (1.5) 

TV 

Moreover, we assume that there is a definite probability distribution function H{X) 
such that 

Hn{X) =^ H{X), N ^oo. 

And we will also need the following terminology. 

Definition 1.2 (Matching to order k.). Let X" = {xf-)M,N and X^ = {xJj)m,n be 
two matrices satisfying (ii) of Condition \1.1\ We say X^ matches X^ to order k if 
for all 1 < i < M, 1 < j < N and nonnegative integers l,m with I + m < k, there 
exists 



¥.{^{^/Nxl)^%^/NxYjD 
= E($R(\/]Vx7^.)'9(^^<ir) + 0(e-('°s^)'') (1.6) 

with some positive constant C > 1. Moreover, if \1.0) holds, we also say that W" 
matches W^ to order k, where 

Under Condition I l.H we can state our main results. 

Theorem 1.3 (Universality for both real and complex cases). Let W" = i;^/^X"(X")*S^/^ 
and W^ = T}''^ X^ {X^)*T}i'^ be two sample covariance matrices satisfying Condi- 
tion \l.l[ where X^ := {xfj)M N and X^ := (xJAm N- Let 

Then for any real number s which may depend on N , there exist some positive 
constants e, 5 > such that 

P(Af2/3(Ai(W") -\r)<s- N-') - N-^ 

< ¥{N'^f^{\i{W^)-X,)<s) 

< F{N^/^{Xi{W)-Xr)<s + N-') + N-^ (1.8) 



A, = -(i + d^i/- -dHNiX)]. (1.7) 



if one of the following two additional conditions holds, 
A; S is diagonal and W" matches W^ to order 2. 

B; W" matches W"^ to order 4. 

Remark 1.4. Actually, we can analogously derive the universality property for the 
joint distribution of the largest k eigenvalues of Wn for any fixed k. That means, 
under Condition \l.l\ and the additional assumption A or B, the following extension 

ofUM 

¥{N'^/'\\i{W) - K) < si - N-\ ..., N^/^Xk{W) - Xr) < Sfc - N-') - N'^ 

< F{N^/'\Xi{W^) -Xr)<si,..., iV2/3(Afe(W") - A,) < Sk) 

< ¥{N^/'\Xi{W) - Xr) < s + N-' , . . . , N^/^{XkiW) - Xr) <Sk + N-')+N^^ 
holds for any real numbers si, . . . ,Sk when N is sufficiently large. 

Combining Theorem 11.31 with Theorem 1 of [lOj and Proposition 2 of [23 we see 
that in the classical complex case (Ex?- = 0), the following more concrete result 
holds. 

Theorem 1.5 (Tracy- Widom limit for classical complex case). Let Wfj be a classical 
complex Wishart matrix and Wn be a general classical complex sample covariance 
matrix and both of them satisfy Condition \1.1[ We have 

if either S is diagonal or Wn matches Wfj to order 4. Here 

Remark 1.6. Correspondingly, we also have that the joint distribution of 

'Xi{WN)-Xr Afc(Wjv)-A, 



TW2 
a 



a a 

converges weakly to the k- dimensional joint TW2 

Below we will give a brief explanation on the choice of the parameter c and 
meanwhile some crucial concepts will also be introduced. With these concepts we 
will present the route of our proof later. Define the N x N matrix 

Wn := X*^X 

which shares the same non-zero eigenvalues with Wat. 

Denote the ESD of Wn by Fn , then it is elementary to see 

FnW = rf^'Zjv(A) + (1 - d^')l{A>0}- 
When there is some definite distribution H such that 

Hn^H (1.9) 
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as A^ — )• oo, it is well known that there are definite distributions F^^h and E.d H such 
that 

Fn =^ Fd^H, Zat =^ Fd,H 
in probability. And we have the relation 

Fd,H = d-^Fd^H + (1 - c^"')l[o,oo)- (1-10) 

For any distribution function D, its Stieltjes transform m£){z) is defined by 

mD{z)= f j^dDiX) 

for all z £ C^ := {oj G C, Qw > 0}. And for any square matrix A, its Green function 
is defined by 

Ga{z) = {A-zI)-\ zgC+. 

For simplicity, we denote 

m{z) := mp^^^^iz), rn{z) := mp;^^(z), rriNiz) := mpj^iz), m^{z) := mF^{z). 

For general S, F^^h usually has no closed form expression. Fortunately, the 
Stieltjes transform of F^^h admits a self-consistent equation. Actually m{z) is the 
unique solution in C"*" of the self-consistence equation 

Tfli Z) ^ 

for z G C"*". From (jl.lOp we have 

dr^ -I 

m{z) = h d m(z). 

However, what we need in the sequel is the non-asymptotic version of F^ih which 
can be obtained through replacing d and H by d]\f and Hi\f in Fg^fj; and thus will 
be denoted by F^j^^Hn- More precisely, F^j^^Hn i^ the corresponding distribution 
function of the Stieltjes transform mdf^,Hf^{z) := mp^ „ {z) G C^ satisfying the 
following self-consistent equation 

nid^,HM = ^ 1 . f -, z G C+. (1.11) 

In other words, by the inverse formula of Stieltjes transform, we can define 

0, if X < 0, 



FdM,HMi^) '■'- 



l|div>l}(l-C^Ar^), if X = 0, 



Correspondingly, we can define the non asymptotic versions of F_d u and rn{z) de- 
noted by Zdjv.-ffiv ^^^ inidi^,Hi^{^) respectively. For ease of notation, we will briefly 



denote 



mo{z) := md^,Hr,{z), mo{z) := mdjv,Hjv(^)' ^o := F^^^Hm^ Zo := Zdjv,Hjv 



in the sequel. By definition, we have 

d~^ - 1 

moiz) = — h d^^niQiz). 

Moreover, the relation above still holds if we replace mo{z) and rn^iz) by m]\f{z) 
and m^(z) respectively. 

By discussions in |32] we can learn that Fq has a continuous derivative po on 
M — {0}. Actually, by Lemma 6.2 of [3j, it is not difficult to see the rightmost 
boundary of the support of pQ is A,, defined in ()1.7p . i.e. 



Moreover, there exists 



Xr =inf{x G R : Fo{x) = 1}. 



lim moiz). 



The existence of lim^gc+_i.^ rno{z) for x G M \ {0} has been proved in [321|. 

Our main result Theorem 11.31 can be viewed as a generalization of the edge univer- 
sality for sample covariance matrices in the null case provided in |3U] by Pillai and 
Yin to a large class of non null cases. In [30], the authors adopt a Green function 
comparison strategy which was raised in [2T] and (22j for Wigner matrices origi- 
nally. Such a strategy will also be adopted in our non-null case. The Green function 
comparison strategy relies on a strong law of local eigenvalue density which asserts 
that the limiting spectral law is even valid on short intervals which contain only 
A^*" eigenvalues for any constant e > 0. Such a limiting law on microscopic scales 
was developed in a serious works |151 [T6l [T71 [22] for Wigner matrices originally and 
was shown to be crucial in recent works on universality problems of local eigen- 
value statistics, one can refer to [181 ED El] and the survey paper [Tlj for instance. 
Moreover, such type of results are not only inputs of the Green function comparison 
strategy but also of interest in their own right. The main route in |2H [22| [30] can 
be summarized into the following two steps. 

1: Deriving a strong law of local eigenvalue density. 

2: Establishing a Green function comparison theorem. 

Roughly speaking, step 2 can lead one to the final universality result and step 1 
is the main technical input to fulfill step 2 in this two steps strategy. In the sequel, 
we will sketch the main ideas of the Green function comparison strategy and give 
a short review on the local eigenvalue density problems developed in the previous 
works at first. And then, we will sketch the main difficulties and the corresponding 
methods to conquer them in the process of pursuing such a strategy for our model. 

• : Green function comparison strategy 

The main idea of the Green function comparison strategy is that when we want 
to show the universality of a statistic of some random matrix ensemble, at first we 
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need to represent the target statistic in terms of some functional of the Green func- 
tion or sometime just the Stieltjes transform of the random matrix. Then we can 
use the classical Lindeberg swapping strategy to compare the distributions of the 
functional of the Green functions of two matrices. Owing to the simplicity of the 
Green function expansion formula, such a comparison strategy is usually effective. 
This strategy was raised in [201 l2H [22] for deriving both bulk and edge universality 
of a class of generalized Wigner matrices, and used in [14J for the local spectral sta- 
tistics of Erdos-Renyi graphs. Recently, Pillai and Yin adopted the Green function 
comparison strategy to obtain both bulk and edge universality of sample covariance 
matrices in the null case in [30]. Instead of replacing entries one by one in the 
Lindeberg swapping process in |20l [211 IISI [H] , the authors in [30] use a strategy to 
replace once a column of the sample covariance matrices in the swapping process. 
Such an idea will also be pursued in our non-null case. 

• : Local eigenvalue density problem 

In order to use the Lindeberg swapping strategy on the functionals of Green 
function, some estimations on the entries of the Green function and the Stieltjes 
transform are necessary as the main technical input. Precisely, the strong law of 
local eigenvalue density needs to be proved at first as mentioned above. The local 
eigenvalue density problem was raised by Erdos, Schlein and Yau in [I5j for Wigner 
matrices, in which the authors showed that the semicircle law holds in bulk even on 
microscopic scale of A^^^'^logA^ when the distributions of the entries of a Wigner 
matrix satisfy some regular properties. In terms of the Stieltjes transform, this re- 
sult can be described as that the Stieltjes transform of the ESD of Wigner matrix 
concentrates on that of the semicircle law for any z = E + i-q when E is in the bulk of 
the semicircle law and r] > iV~^' ^ log A^. The assumption on the distributions of the 
matrix entries and z (especially the lower bound of rj) were improved in a series of 
subsequent works [161 [T71 [22] . Especially, in [22j , the local seinicircle law was shown 
to be valid for t] > N^^{logN)^^^°^^"^^' and E is allowed to locate in an interval 
including the whole support of the semicircle law therein. More specifically, in [22], 
the difference between the the Stieltjes transform of ESD and that of the semicircle 
law can be bounded by {Nr])~'^(log A^)*^('°g^°g-^) in magnitude with high probability. 
Such a result is usually referred to as the strong local semicircle law for generalized 
Wigner matrices. Analogously, the local MP law for the sample covariance matrices 
in the null case were proved in [351 I3H1 [13 [3DJ ^^ varying degrees. Especially, in [30j . 
the authors derived a strong local MP law. Very recently, Erdos and Farrell studied 
the local eigenvalue density of generalized MANOVA matrices in the bulk case in 
[12] . As a by-product, the authors in ^2j also provided a MP type law for the local 
eigenvalue density of matrix T^''^XX*T^''^ in the bulk case, where T is specified to 
be the inverse of another sample covariance matrix which is independent of XX* . 
Obviously, the matrix T^/'^XX*T^''^ in [T2] can also be regarded as a sample covari- 
ance matrix with the special random population T. In this sense, Ll2j shed light on 
establishing the local MP type law for Wn under our assumption. 



• Proof route and novelties of our paper 

Roughly speaking, we will also adopt the Green function strategy to prove our 
main result Theorem 1 1.3[ However, since we only care about the largest eigenvalue, 
we will restrict the discussions on the rightmost edge of the ESD. In other words, 
our main task is to provide the following two: 

1': A strong local MP-type law around A,-. 

2': A Green function comparison theorem on the rightmost edge of the ESD. 

However, the generality of S in our setting produces a lot of obstacles in pursuing 
such a two-steps strategy. 

At first, in the null case, the limiting spectral distribution is well known as 
Marcenko-Pastur law (MP law) which has a closed form formula. As a consequence, 
the properties of Stieltjes transform of MP law can be easily obtained. Actually, 
these basic properties are crucial inputs for establishing the edge universality in the 
null case. One can refer to Lemma 6.5 of f30] for instance. However, in the non-null 
case, to get the corresponding properties around Aj. is not a trivial thing. Actually, 
analysis towards the behaviors of po and rnoi^z) will be our first main task. We 
will show that po{x) admits a square root behavior in an interval [Xr — c. A,.] for 
some small positive constant c. Such a square root behavior is guaranteed by (iii) 
of Condition 11.11 and will be the basic input to establish the properties of mQ{z) in 
Lemma 12.31 

After that, we can start to establish 1' and 2' . It will be found that the roadmap 
of the proof in the null case in [30j is also applicable even under our general setting. 

In the first step 1', we will follow the bootstrap strategy developed in [20 l [22 l [T3 | 
[50] to establish the strong MP-type law around A^. The word "bootstrap" means 
that one can provide a weak law of local eigenvalue density at first (in our case 
see Theorem 13. 3|) . then the weak law can help to obtain the desired strong law 
(see Theorem 13. 2|) through a bootstrap process. One main technical tool to derive 
the strong law from the weaker one is an abstract decoupling lemma from [30j (see 
Lemma 7.3 therein) which can help to bound the summation of a class of weakly 
dependent random variables. Such a decoupling lemma is similar to Theorem 5.6 
of [13] and Lemma 4.1 of [22] but is more general and applicable for our model. 
However, most parts of the proof require more general treatments. In the proof of 
1', one main difficulty is that the self-consistent equations for the Stieltjes transforms 
{mi\i and mo) are more complicated than the null case. Such a complexity makes 
the proof of the strong local MP type law much more cumbersome. For example, in 
[30] . once the closeness of the self-consistent equations of m^ and tuq is obtained, 
the difference between mjsf and niQ themselves can be characterized easily with the 
aid of the closed form of tuq. However, in the non-null case, this step is much more 
indirect. To overcome this difficulty, we will rely on a discussion on the stability 
of the self-consistent equation of the Stietjes transform for T^/^XX*T^'^ in |T^ . 
Though the argument in [T^] was only provided for the bulk case, we find it can be 
generalized to the edge case under our assumptions on Wn- 
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In the proof of 2', we will see that a main step for the Green function comparison 
strategy is to bound the entries of the matrix S^'^(W — z)~'^T}-''^ when z = E + ir] 
with IE' — A^l < N~^''^~^'' and tj = N~'^'^~^ for some small positive constant e. When 
S is diagonal, obviously we can bound the entries of the Green function (W — z)~^ 
instead. To this end, by using spectral decomposition and the strong local MP type 
law, we will see that it suffices to provide the so called delocalization property for 
the unit eigenvectors corresponding to the eigenvalues around A^. The delocalization 
property for eigenvectors of random matrices was also raised in [15] and improved in 
the series works |16l [T71 \T2\ for Wigner matrices and extended to sample covariance 
matrices in |35l [BUI [TUl [50] , which asserts that the coefficients of the unit eigenvectors 
are bounded by N~^''^{logN)^^^"^^°^^' uniformly with high probability. Hence, 
when S is diagonal, we will provide a delocalization property for the unit eigenvectors 
of Wn in the rightmost edge case. Then with the aid of the delocalization property 
we can bound the entries of the Green function of W in magnitude. However, for 
more general S, the issue becomes much more complicated. In general case, we 
turn to derive a rough bound on the imaginary parts of the diagonal elements of 
the matrix ^^/^(W - z)-^S^/2 ^^^^^ ^ ^ ^ j^ ^^ ^j^j^ |^ _ ;^^| < j\r-2/3+e ^^^^^ 

rj = N~'^'^~^ . It will be seen that such a rough bound still allows us to establish 
the Green function comparison theorem only with the assumption of 4 matching 
moments. Our main idea is to provide a rough bound for the diagonal entries of 
Si/2(VV; _ zo)"^S^/^ with zo = E + ir]o , where rjo = Ar-2/3+£ ^ ^. Then by 
this rough bound we can obtain a rough bound on (S^/^UjU*S^/^)fcfc, k = 1, . . . , M, 
where Uj is the unit eigenvector of W corresponding to Aj which is around A^. Then 
the bounds for (S^'^UjuTS^'^)^^ can turn to help us to provide a rough bound on 
the imaginary parts of the diagonal elements of the matrix S-^'-^(W — z)~^Ti^'^. 

Throughout the paper, we will use the notation O(-) and o(-) in the conventional 
sense. And we will use C, Co, Ci,C2, C3 to denote some positive constants whose 
values may be different from line to line. We say 

if there exist some positive constants Ci and C2 such that 

Ci\y\ < \x\ < C2\y\. 
We say two functions f{z),g{z) : C — t- C have the relation 

f{z) ~ g{z) 

if there exist some positive constants Ci and C2 independent of z such that 

Ci\giz)\<\f{z)\<C2\9iz)\. 
Define the parameter 

(/p:=(^^ = (logiVy°gl°s^ 

which will be used throughout the paper. We say an event S holds with C-high 
probability if there is some positive constant C such that 

P(§) > l-7V^exp(-(/?'^) 
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when N is large enough. Moreover, we will use Spec{A) to denote spectrum of a 
matrix A. And we will denote the operator norm and Hilbert-Schmidt norm of a 
matrix A by ||^||op and H^Hi^^g respectively. 

This paper is organized as follows. In Section 2, we will study the properties of 
mo(z) and po which will be crucial for our further analysis. In Section 3, we will 
prove that rriN^z) is close to mQ^z) when Kz is around Ar,i-e. the strong local MP 
type law holds around A^. In Section 4, we will use the strong local MP-type law 
and a Green function comparison approach to prove Theorem 11.31 and II. 5i 



2. Properties of pq and mo(z) 

In this section, we will prove some crucial properties of pQ and mo{z). At first, 
note that by the definitions of c, A^ and (iii) of Condition II. H it is easy to see 

c, \r ~ 1. 

Below we will provide some crude lower and upper bounds on Ai(W). Note that 
we have the following elementary inequality 

AAf(S)Ai(XX*) < Ai(W) < Ai(S)Ai(XX*). 

Now by the rigidity of the locations of the eigenvalues of XX* provided in Theorem 
3.3 of ^30j and Condition II . II we know for any fixed positive constant Q there exists 

Aa./(S)(1 + \/d)V5 < Ai(W) < 5Ai(S)(l + ^/df (2.1) 

with (^-high probability. Now we set 

Ci = Am(S)(1 + ^df/Co, Cr = CoAi(S)(l + Vdf. 

with sufficiently large Cq such that A^ G [2C;,Cr./2]. Then by (|2.1|) . we can choose 
Co sufficiently large such that 

Ci < Ai(W) < Cr (2.2) 

with (^-high probability. In the sequel, we will always write 

z := E + IT], 

and define the parameter 

K, := k{z) = \E — Xr\. 

Moreover, for C ^ 0, we define two sets, 

S{() ■.= {zeC:Ci<E<Cr, f'^N-^ < ?? < 1}, 

and 

Sr{c, C)--={z€C:Xr-C<E< Cr, if^ N'^ < ?? < 1}, 

where c is a sufficiently small positive constant whose value will be specified later. We 
will prove the following two lemmas. The first one claims the square root behavior 
of po(x) on interval [A^ — 2c, A,.] with some small positive constant c. 
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Lemma 2.1. Under Condition \1.1\. there exists some sufficiently small constant 
c > independent of N such that 



Po{x) 



y \r — X, for all x G [A^ — 2c, A,.]. (2-3) 



Remark 2.2. Note that in [27J and [32] the square root behavior of pq{x) near the 
boundary of its support has been discussed. However, the results in [27] and [32] 
does not imply i2. 3\) since here mo(-z) and po{x) are N-dependent. For general S, it 
is possible for the square root behavior only holds in an interval ( with A^ being its 
right end) with length of o(l). For example, in the 1 spike case S = (100, 1 . . . , 1), 
\2.3\) does not hold actually. 

The second one collects some crucial properties of tuq^z). 

Lemma 2.3. Under Condition \1.1[ for some sufficiently small positive constant c 
satisfying Ii2.3\) . the following four statements hold. 
(i): For z G S{Q), we have 



|mo(z)| ~ 1, 
(a): For z G S'r.(c, 0), we have 



if E > \r + r] 



9mo(z) ■ 

yjK + rj, if E e [\r - C,Xr + Tj) 

(Hi): For z G ^(O), we have 

Nr] N rj 

for some positive constant C. 
(iv): For z G S'r-(c, 0), we have 

|1 + tm^{z)\ > c(l + Ai(S)mo(A,)) > cq, Vi G [Am(S), Ai(E)] 

for some small positive constants c,cq. Moreover, there exists a sufficiently small 
constant fj := r/(co), when z G 5r(c, 0) with rj < fj, we also have 

l + t3f?mo(z) >c(l + Ai(S)mo(A^)) >co, Vt G [Am(S), Ai(S)]. (2.4) 

Remark 2.4. Note the second inequality in (Hi) of Lemma \2.'JM mplies that QmQ{z)/r] 
is decreasing in ry. 

At first we come to prove Lemma 12. H the proof of Lemma 12.31 will heavily rely 
on Lemma l2. 11 



Proof of Lemma \2.1[ For ease of presentation, we denote 

mi{z) = ^mo{z), 771-2(2:) = 9mo(z). 
Now let 

mo{x) := lim mo{z), mi{x) = ^mo{x), m2{x) = QmQ{x). 
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By Theorem 1.1 of [32j, we know mo(x) exists for all x G M \ {0}. Moreover, mo(x) 
is continuous on M\{0}. By definition and the inverse formula of Stieltjes transform 
we have 

po{x) = -m2ix), xeM\{0}, 

IT 

thus it suffices to prove the square root behavior of m2{x) when x is to the left of 
Ar- Note that by the fact m2{\r) = we have 

fn2{x) = { — 771,2771-2 (t)(it I 

Thus it suffices to show for some sufficiently small constant c > there exists some 
positive constants C > C independent of t such that 

-C < 77727772 (t) < -C", for ah t £ [Xr - 2c, A^). (2.5) 

To verify (|2.5p . we start from (jl.lip . which can be rewritten as 

z = -m-,\z) + djl I —±^dHN{t). (2.6) 

Note that (|2.6|) also holds for x G M \ {0}. When Kz > C2 for any fixed positive 
number C2, it is easy to see from (|2.6p that 

777o(z) ~ 1, C2 < Kz < a, (2.7) 

which also implies 

Po{x) = —7772(3;) < 0(1), C2<X< Xr. 

vr 
Set z = X in (|2.6p and write down the real and imaginary parts of both two sides 
we can get 



mi ^ ^_i f t{l + tmi)dHN{t) 

ml + 1712 ^ J (1 + i777i)2 + ^^7772 



= "^2 ( ^— -2 - rf^W 7TX7:r:\^T7^ 1 (2-8) 



^m\+m\ ^ J {I + tniiY + t'^'m\ 
When 7772 (x) 7^ 0, i.e. 7772 (x) > 0, (|2.8p implies that 

^_i [ tdHN{t) 



^ J {l + tmiY+t'^ml 

- 1 ,-1 / t^dH^jt) 

ml + ml ^ y (l + i777i)2 + t2^2' y^--^) 

For simplicity, above we have omitted the variable x from the notations 7771 (x) and 
7772 (x). We remind here by continuity and (|2.7p . ()2.9p still holds when A > C2 is a 
boundary of the support of 7772 (x). Moreover, when A is a boundary of the support 
of 7772 (x) and A > C2 with any fixed positive number C2, ()2.9p can be simplified to be 

1 i-i / ''dH^^') (210) 

"-777KA) ^^y {i+tm,{x)Y ^'-'"^ 
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since ?n.2(A) = in this case. Our analysis below will rely on (|2.9|) . thus at first we 
need to guarantee the validity of the following lemma. 

Lemma 2.5. Under Condition \1.1\ there exists some positive constant c such that 

m2{x) = iTpo{x) > 

in [Xr — 3c, Xr). 

At first, we proceed to prove Lemma [2.11 assuming the validity of Lemma [2. 51 and 
prove Lemma 12.51 after that. Using (|2.8p and Lemma 12.51 we have (|2.9p . Now taking 
derivatives with respect to x implicitly on both sides of two equations in ()2.9p . we 
can get 

J _ mi{A2 + miAs) - mlA ^ 
{A2 + mi^3)2 + valAl 



where 



[A2 + miA^Y + rr^ ^^ ^ ' 

A, 



h - ^% ] ((1 ^ ^.2 + ^2^2)2 ' •? - ^' •^• 



((l+tmi)2+t2^2)2' 

Note that obviously Aj > 0,j = 2,3. Moreover, if 

min(l + Aj(S)mi(x)) > c', for all x e [Xr - 2c, A^) (2.12) 

i 

for some positive constant c', we also have in [A^ — 2c, A,-), 

< mlA3,mlAl < Ci (2.13) 

and 

C2 < A2 + miAs < C'2 (2.14) 

for some positive constants Ci and C2 < C2. If we can take a step further to show 

-2ci <mi(x) < -ci (2.15) 

for some positive constant ci, then ()2.5p immediately follows from (|2.1ip . ()2.13p . ()2.14p 
and ()2.15p . Therefore, it remains to show there exists some small positive constant 
c such that (|2J2]l and (f2J5]l holds for all x e [Xr - 2c, A^). 
Now note that 

"ii(Ar) = wio(Aj.), m2(Aj.) = 0. 

At first , by the fact that mi{Xr) = — c and (|1.3p we see mi (A,.) < 0. Setting z = Xr 
in (|2.6p we can easily see that 

- Jci < mi{Xr) < -^ci (2.16) 

for some positive constant ci. And by (iii) of Condition 11.11 and the fact that 
"^i(Ar) = — c we also have 

min(l + Ai(i;)mi(A,.)) = 1 + Ai(S)mi(A^) > 2c' (2.17) 

i 

for some sufficiently small positive constant c'. Now we start from (|2.16p and (J2.17P 
to prove H^mh and H^ZTTSh by continuity. To show (IXT^ and ([XT^ hold for all 
x G [Xr — 2c, Xr] with some sufficiently small positive constant c , we need to control 
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the |m-'(x)|. Differentiating implicitly the first equation in (|2.9p with respect to x 
again and use (|2.11|) . we can get 



I m'l (x) 



1 + A 



mi{A2+m-iA3)—m'^A3 
' (A2+A3mi)2+A|m| 



A2 + miAs 

< Cmin|l + Aj(S)mi(x)r^ (2.18) 

i 

which can be easily checked by the definition of Aj,j = 2, 3. Now we set 



Ao := Ao(c) 

= infix G [A,.-3c,A^) : , "^, iL ./I > 1/2 and mi (t) G [-2ci, -ci], Vx <t < AJ- 

1 + Xi[h)mi{Xr) 

(2.19) 



Now we claim when c is sufficiently small, there exists 

Ar - Ao > 2c. (2.20) 

Otherwise, we can assume A,. — Aq < 2c for arbitrary small constant c thus Aq > 
Xr — 3c. Then by continuity we have 

l + Ai(i;)mi(Ao) = l/2(l + Ai(S)mi(Ar)), or mi(Ao) = -2ci or - ci 

If 1 + Ai(S)mi(Ao) = 1/2(1 + Ai(S)mi(Ar)), by using (l2T8]) we have 

c' < 1(1 + Ai(S)mi(Ao)) - (1 + Ai(S)mi(A,))| < C|l + Xi{^)mi{X^)\-^c 

for some Ag G [Aq, A,.]. However, by the definition of Aq, we get a contradiction if c 
is selected to be sufficiently small. If ?n,i(Ao) = — 2ci or — ci, we see 

\ci < \miiXr) - mi(Ao)| < C\l + Xiil2)mi{X^)\-'^c. 

Then by definition of Aq, we also get a contradiction when c is small enough. Thus 
we conclude the proof. D 

Now we come to prove Lemma 12.51 



Proof of Lemma \2.5[ We define the largest endpoint of the support of po smaller 
than Ar- by A,-. It suffices to show that there exists a sufficiently small constant c 
such that 

Xr - Xr- > 4c. (2.21) 

Note that by ()2.10p . we know when A is an endpoint of the support of po and A > C2 
with any fixed positive number C2, there must be 

In the sequel we assume A^- > C2 for some fixed positive number C2, otherwise ()2.2ip 
holds naturally. We already know that mi{Xr) is the unique solution of the equation 

+2^2 



/(TT^^''^^') = ^^ 
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in (— l/Ai(S), 0). Thus we have 

mi(Ar-) GM\(-1/Ai(S),0). (2.23) 

By (iii) of Condition II. H we have 

- 1/Ai($]) - c" < mi{Xr) < -jci < -c" (2.24) 

for some small positive constant c" . Here the upper bound in ()2.24p follows from 
(f236D . Hence, (l2:2H]l and (l2:2i]l imply 



\mi{Xr) — mi{Xr-)\ > c" 
Now use ()2.6p we have 

\{mi{\r) -mi{\r-))\ 



(2.25) 



Ar — Ar 



\mi{Xr)mi{Xr-)\ 



1 / t^mi{Xr)mi{Xr-)dHN{t) 
^ I {l+tmi{Xr)){l + tmi{Xr-)) 



(2.26) 



Note that from ()2.7p we have 

ini{Xr^),mi{Xr) ~ 1. 
Then by (|2.25p it suffices show that there exists some positive constant c'" such that 

t^mi{Xr)mi{Xr-)dHN{t) ^ ,„ 



1-d 



-1 



^ J {1 + tmi{Xr))il + tmiiXr-)) 
Now we come to verify ()2.27p . Note that by (|2.22p we have 

t'^mi{Xr)mi{Xr-)dH]\f{t) 



> c 



1-d 



d 



-1 

N 



^ I {l + tmi{Xr)){l + tmi{Xr-)) 

t^ml{Xr-)dHN{t) f fmi{Xr)mi{Xr-)dHN{t) 



d 



N 



(l + tmi(A^_))2 
mi{Xr-) — mi{Xr 



{l + tmi{Xr)){l+tmi{Xr-)) 
t^mj{Xr-)dHN{t) 



mi{Xr-) J (l + tmi(A,._))2(l + tmi(A,0) 

Moreover, by assumption we have 

1 + tmi{Xr) > c, Vt G 5pec(S). 

Combining these facts with ()2.25p and ()2.22p for ?7ii(Ar-), we obtain ()2.27p with 
some sufficiently small c'" > 0. Inserting ()2.25p and ()2.27p into ()2.26p we obtain 
that (|2.2ip holds with some positive constant c := c{c",c"'), which implies Lemma 
ESI □ 

With the aid of Lemma 12.11 now we can start to prove Lemma 12. 3[ 

Proof of Lemma \2.3[ Note that by definition, 
9mo(z) 



rj 



1 



(x - Ey + ry2 



dFo{x 
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thus the second inequahty of (hi) fohows dhectly. It suffices to verify (i), (ii), (iv) 
and the first inequahty of (iii) in the sequel. At first, we come to show (i). Note 
that when z G 5'(0), there is 

Ci<E< Cr- 

Then it is easy to obtain (i) by (|2.6|) . 

Now we come to verify (ii) . The proof of (ii) rehes on Lemma 12.11 By definition 
we have 

mo{z) = / dFo{x). 

J X- z 

At first we deal with the case of 

E>\r + r]. 
We do it as follows. 

9mo(z) = / FN2 , 2 ^^o(^) 

Jq [x - Ey + r/^ 

2 ^ po{x)dx + 0(77) 



Ci/2 [X-EY+T] 

—2- — :^po{x)dx+ / —2- — :^pQ{x)dx + 0{ri) 

Ci/2 {X - EY + 77^ ;;,,,_£ [X - EY + T]^ 

/■Ar „ 

^+l-.(v--)^+(-+.)^^^^^'" 



Now if c > K + ?7, we have 

(■K+ri 



l^rirw'^''' = I 't^+('+.)^^''+Lt^+('+.)^^'* 






Vi^ + v' 

If c < K + r], then we have 



I 



^ Vtdt ~ / - — ^^^—-rVidt ^ 



/o f^ + in + rjY Jo {K' + 'hY V^ + r/' 

Now we come to deal with the case of 

E E [Xr -C, Ar. +77). 

Note that our discussion for the case oi E > X^ + t] can be extended to the case of 
E > Xr- Actually, in the region E £ [Xr,Xr + f]], n <r]^ thus one has 

I — r- ~ V'^ + ^• 
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Therefore, it suffices to handle the case of E £ [A,. — c, A^). Note 

po{x)dx + 0(7?) 



Ci/2 {x - Ef + rj 



,2' 



Ar-2c ^ |.A,. ^ 

c,/2 {x-EY + .^P^^^^"^ + A„_,, (,-i,). + ,2/'o(x)dx + 0(,) 

By spHtting the integral region [A,. — 2c, Ar] into two parts by |x — i?| > k or |x — -Ej < 
K, it is easy to see 



9mo(^) = 0(a/k+~??). 

Thus we proved (ii). 

Now we start to show the first inequality of (iii) whose proof will also relies on 
Lemma l2. II Note that by discussions above we have 

9mo(2;) > / y^ 2 Po{x)dx > / —^ — i2Po{x)dx. 

Then it is obvious that 

Qmo{z) > Cr], (2.28) 

since 

Po{x) ~ \/Xr — X ~ 1, X G [Ar — 2c, Ar — c] . 

At the end, we come to prove (iv). At first, we claim that when c is small enough, 
there exists 

inf |l + imo(x)| > 1/2(1 + Ai(S)mo(Ar)), Vx G [A^ - c, C^]. (2.29) 

te[AM(S),Ai(S)] 

To see ()2.29p . we split the interval into (A^, Cr] and [Aj. — c, Aj.]. For the first case, it 
is not difficult to see mo{x) = mi(x) is negative and increasing, thus 

inf |l + tmo(x)| = inf (1 + tmoix)) 

ie[AM(S),Ai(S)] ' t6[AM(S),Ai(S)]' 

= 1 + Ai(i;)mo(x) > 1 + Ai(S)mo(Ar) (2.30) 

when X > Ar- For the second case, we recall Aq defined in (|2.19|) and the inequality 
(g^D]). Then it is obvious that 

inf |l + tm,o(x)| > inf (1 + tmi(x)) 

te[AM(S),Ai(S)] ' te[AM(S),Ai(S)]' 

= 1 + Ai(S)mi(x) > ^(1 + Ai(S)mo(Ar)) (2.31) 
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when X G [A.^ - c,\r\. Thus ([X^ follows. Moreover, (pTBH]) and ^^X^SY) also imply 
that 

inf (l + tmi(a;)) > -(l + Ai(S)mo(A^)), x G [A,. - c, C,.]. (2.32) 

te[AM{S),Ai(S)] 2 

Now we extend ()2.29p and ()2.32p from the real line to the full region Sr{c, 0). At 
first, we use an elementary inequality which can be found in ^ (see the proof of 
Lemma 6.10 therein), in which the following inequality is proved. 



\{m{z)x + 1 



,-1 



< max ( — , 2 1 , 

7? 



(2.33) 



where m[z) := m{E + irj) can be the Stieltjes transform of arbitrary probability 
measure and x can be any positive number. By (j2.33p . we know that 



Tf] 



inf ll + t?Ti-n(^)l > niin , , ,„, , 

te[AM(E),AUS)] ' °^ ^' - V4Ai(S)'2 

Therefore, it suffices to show (|2.4p when ry is sufficiently small. To this end, we will 
combine ()2.32p and a bound of derivative of mi{z) with respect to r] which can be 
obtained as follows. By definition, we have for any r/ > 0, 

(x — E)rj 



mi{z) 



x-E 
{x - EY + r?2 



dFo(x), dr^mi{z) 



((x-^)2 + r?2) 



2^2 



dFo(x). 



Now let a be a small positive constant. At first, we handle the case of -E G [A^— c, A^]. 
We split the estimation of drjmi{z) into two cases: k < 2r]^^'^ and k > 2?]^^°. Note 
that for the first case, we have 



\d^mi{z)\ 



< C 



+ 



{x — E)r] 



T] 



((X-S)2+J?2)2 



po{x)dx 



+ 0{r^) 



, 'Ci/2 



{E - xf 

-l/2-ba/2\ 



-dx + / r/" 



'" . r/-^ . r^^^~°')l'^dx + 0(7?) 



when a is chosen to be sufficiently small. 
When K > 2rf'~°' , we have 



\dr^mi{z)\ 



E+r)^ 



< 2 



E--q'^-°' f-Xr 

+ / + 

Ci/2 JE+r}^-" JE-q^-" 



{x — E)r] 



((x-^)2+r/2)2 



po{x)dx 



+ 0(7?) 



(x — E)r] 



E-r,^-<- {{x-EY+r] 



2^2 



po{x)dx 



+ 07]- 



-l+2a 



E+T]^ 



(x — E)r] 



^_,i-. ((x-E)2+r/2): 



:ipo{E) + Po{^{E,x}){x - E))dx 



+ C7] 



-l+2o 
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by the mean value theorem, where C{e,x} is some real number between E and x. 
Note the analyticity of poix) in its support has been proved in [32J. Moreover, we 
have 

which is implied by ()2.5p and Lemma [2. II Thus by the assumption of k > 2r/^~" we 
have 



\drjmi{z)\ 



< 



< 



< 




(x — E)TI 



-l+2a 



^3-2a . ^-4 



r,-^i^+^i^dx + r^+ 



2a 



when a is chosen to be sufficiently small. 

The case of E G (A^, Cr) is similar and simpler, thus we omit the details. Actually, 
for E G [Xr — c, Cr] , we have 

|5,mi(z)|<0(7r'+'") 

for all rj < fj with some some sufficiently small constant r}. 

Now we recall (|2.29|) and fix an ^ G [A^ — c, Cr]- Since mo{E + irf) is continuous 
for ?7 > 0, there must exists some small positive ?7o such that 

1 + tmi{E + i7?o) > ^(1 + tmi{E)) > ^(f + Ai(S)mi(A,)), t G [Am(S), Ai(S)]. 

Now consider E + i-q £ Sr{c,0). Note that we have mentioned above it suffices to 
consider the case when r] is sufficiently small. Observe that we have 



\mi{E + ir]) — mi{E + ir]o) 



dtmi{E + it)dt 



VQ 



<C 



t-^+^^dt 



m 



<Cr] 



2a 



Therefore, when rj is sufficiently small, we have 

inf (f + t{^)mi{E + if])) > c(l + Ai(E)mi(A,.)) > cq 

te[AM(S),Ai(S)] 

with some small positive constant c, cq. Therefore, we conclude the proof. 

3. Asymptotic analysis of miy{z) 



D 



In this section, we will prove the strong local MP-type law around A,-, see Theorem 
below. To this end, we will derive a self-consistent equation for rriN^z), which 
is quite close to that of mo{z) (see ()l.lip ). Then we can figure out the closeness of 
mj\f(z) and mo{z) through studying the stability of the self-consistent equation of 
mo{z). Such an approach is classical in the Random Matrix Theory, one can refer 
to [3] for instance. Actually, for our matrix model Wat, the concentrate inequality 
for m]\f{z) around mQ{z) was established in [2] for t] > 0{N^^''^). We also have 
mentioned in Introduction that in [T^ the local eigenvalue density problem was 
studied towards a special matrix model T^''^XX*T^'^ in the bulk case, where T is 
specified to be the inverse of another sample covariance matrix independent of XX* . 
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In [12], a weak local MP type law was obtained in the bulk case for T^''^XX*T^''^. 
The main proof route in this section is parallel to that in [3U]. However, owing to 
the generality of S, most parts require more general treatments. 

To simplify some discussions in the sequel, we will truncate and renormalize 
\/Nxij at first. Let Cg be some sufficiently large positive constant. We set 



ij — Xijlj. n^ . . .c' 



^ij 



{\VNx,A<i^ogNfor '1 ^VarixijY 
Now set X := {xij)M,N and X := {xij)M,N- Correspondingly, let 

Moreover, by (jl.4p we see 

Wn = Wn (3.1) 

with probability larger than 

l-0('iV2exp(-(logiV)^«^o)') >l-exp(-(logiV)2) 

if we choose Cq to be sufficiently large. Moreover, note that 

Xij = Xij + 0{exp{-{logNf^'o)). 

Thus by using basic perturbation theory of eigenvalues such as Weyl's inequality, 
we have 

max|Ai(>V7v) - Xi{WN)\ < iV^^^^ exp(-(logiV)^^o. (3.2) 

i 

Consequently, by ()3.ip and (|3.2p we can work on Wn instead of Wat. Moreover, by 
(|1.4p it is easy to check that Wn matches Wn to order k with any fixed k. That 
means, if W" matches WJ^ to order k, we also have W^ matches WJj^ to order k. 

For ease of presentation, we recycle the notations X and Wat to denote X and 
Wn, and we also denote the truncated and renormalized variable Xij by Xij in the 
sequel. Thus without loss of generality, we will assume 

max|\/iVxio| < (logiV)'^o. (3.3) 

Now we introduce some notations. We denote the Green functions of Wn and 
Wn respectively by 

Gn{z) := {Wn - z)-\ GnIz) = {Wn - z)"', 

where z G C^. For ease of presentation, when there is no confusion we will omit the 
subscript N or variable z from the above notations. Note that by definition we have 

mN{z) = —TrG{z), m^iz) = —Trg{z), 

and 

M - N 
TrG{z) - TrG{z) = . 

Furthermore, we use Xj to denote the i-th column of X, and introduce the notation 
X^"^' to denote the Mx (A^ — |T|) minor of X obtained by deleting Xj from X if i G T. 
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For convenience, we will briefly write ({«}) and ({i,j}) as (i) and (ij) respectively. 
Correspondingly, we denote 

and 

Gm(z) = (t^m - z)-\ g^'^Hz) = (w(^) - z)-\ 

In the sequel, we will keep the names of indices of X for X' ' . That means 






Correspondingly, we will denote (i,j)-th entry of G'^'^'^z) by G^- (z) for all i,j T. 
Similarly, we remind here the index {i,j) is not in the conventional sense. Note that 
by definition, G^ ' is an (A^ — |T|) x [N — |T|) matrix. However, here we use the 
index set {1, . . . ,iV} \ T instead of {1, . . . ,iV - |T|}. Set 

At first, we state the following Lemma which collects some basic formulas on the 
entries of Green functions. 

Lemma 3.1. Under the above notations, we have 
(z): 

1 



Giiiz) 



+ zr*g«(z)r/ 



(n): 
(in): 



G,j{z) = zGuiz)G^;hz)r*g('^Hz)rj, i + j. 



■<(k)(^^ , Gik{z)Gkj{z) 
Gkk{z) 



G.,{z) = Gl''>{z) + -''X',T\ hJ^k. 



Proof. One can refer to Lemma 2.3 of [30j or Lemma 3.2 of [22] for instance. Actually, 
if we regard r j as Xj in [30J , then G has the same structure as that in the null case 
in [30j, so does g. D 

By Lemma |3. II we see that 

TV N 

"^^(^) = ^ E Gu{z) = -T7 E ^ .ri^)i , ■ (3-4) 

i\ ^ JS ^ z + zr*g^^>{z)Yi 

2=1 2 = 1 * ^ ^ 

Now we write 

1 ^ 1 
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where 



Yi = zr*g'^\z)ri + ^rr(m^(z)S + I)-^ll. 



Observe that 

Below we wih use the decomposition 

where 

Ti := Tiiz) = r*g«(z)r, - lrrg«(z)S, 

C/i := U,iz) = lTrg«(z)S - ^rra(z)S, 

V := V{z) = ^Trg{z)^ - ^Tr(-zm7v(^)S - z/)-^S. (3.6) 

Note that by definition 

N N 

W = ^r,r*, g{z) = (Y^r^r* - zI)-\ 

i=l i=l 

Thus 

^-1 



Q{z) — {—zmN{z)I^ — zl) 

N 
-{—zm]y{z)T, — zI)" 



^Fjr* - {-zmN{z))T. 



i=l 



Q{z). 



Now using the Sherman-Morrison formula 



r*(C + rr*)-^ = ^-——v*C 






for any invertable matrix C, we obtain 

1 + r*g(^>{z)ri 
Therefore, we have 

TV 

-{-zm]\f{z)T, - ziy^ ^ rir*g{z) 

i=l 

N 



E n ^ *]^(^)( ^ ^ • ("^^(^)^ + /)-V,r*g«(.). (3.7) 

^ z{l +r*gW z Fj 



^-z(l+r*g»(z)r,) 
Taking (|3.4p into account we obtain 



-{—z'mi\f{z)T, — zl) {—zmi\i{z))TjQ{z) 
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_:^z(l+r*gW(z)r,) N 

Combining ()3.7p and p.Sp we obtain 

Giz) — {—zmN{z)T, — ziy^ 



(3.8) 



^ 1 



^ ,, ^ ,^.,), , , (m^(z)$] + /)-ir,r:g«(z)-l(m;v(^)S + /)-^Sg(z) 
:^ z(l + r*^W(2;)ri) [ N 

Therefore, we can further decompose V[z) defined in p.6p into four parts. 

V{z) := vi{z) + V2{z) + V3{z) + V4{z), 

where 

^i(^) '■= -N^^^^ r:g«S(mW(z)S + /)-ir,--rr(m«(z)S + /)-^Sg«S 



^^2(2) 



V3{z) 



V4{z) 



N 

1 

'iV 



Af 



^ G,i r*g»S(m,v(^)S + ly'r, - r*g«S(m»(z)S + I)-'v, 



i=l 

N 



1=1 

N 

i=l 



N 

1 

'iV 



1 



1 



^rr(m«(z)S + /)-isg«S - -rr(m«(z)S + I)-isgS 



-irr(mi;^(z)s + /)-isgs - ^rr(m7v(z)s + ly^T^Gi: 



1 



Moreover, we will denote 

vii := vu{z) = r*g«S(m»(z)S + I)-\, - ^Tr{m^S{z)i: + /)-isg»S, (3.9) 



N 



thus 



Vl 



1 ^ 



iVli. 



i=l 



In the sequel, we will also encounter the quantity 



Riiz) := r*g«S(m;;)(z)S + /)-iS(mo(z)S + /)-iri 



-^rrg»S(m^)(z)S + /)-iS(mo(2)S + ly^^. 



Observe that if we can show that Yi's are small enough, (|3.5p turns out to be close 
to (|l.lip . Roughly speaking, our main task in the sequel is to bound the quantities 

\Y,{z)\, i = l,...,N. 

in some region we are interested in. Then we will take a step further to figure out the 
closeness of mN{z) and mo{z). Specifically, We will provide bounds for the following 
quantities 

Ad{z) := max\Gii{z) - mQ{z)\, Ao{z) := max\Gij{z)\, A{z) := \mN{z) - mo{z)\ 
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for all z G Sr{c,C) with some positive constant C. Our target in this section is to 
prove the following theorem. 

Theorem 3.2 (Strong local MP type law around Aj.). Under Condition IJ.il and 
i3.3\) . for any C > there exists some constant C^ such that 

n (^(^)^^''^]^} (3.10) 



ze5r(c,5Cj) 

holds with (^-high probability, and 
(n): 



Cc i ^^o{z) , 1 

2e5r{c,5C<;) I 

holds with C-high probability. 



n ^A„,.)+M.)s.Mj^.^ (3.11) 



Roughly speaking, in this part, we will adopt the proof route of that for Theorem 
3.1 of [3U]. At first, we will provide a weak bound for the strong local MP type law 
around the right edge. And then we use the weak bound to get the strong bound. As 
mentioned in Introduction, such a bootstrap strategy was developed in series works 
[201 ESI [m [3D] . However, since the generality of our setting on S, most parts of the 
proof need new techniques thus the details are relatively different from those of the 
null case in [30] . Especially, in the discussion of the stability of the self-consistent 
equation of 7710(2;), we will extend an idea from [12] to our case. 

3.1. Weak local MP type law around Xr. In this subsection, we will prove the 
following weak local MP type law in the region around A,.. 

Theorem 3.3. Under Condition \1.1\ and i3.3\) . for any C > 0, there exists some 
positive constant C(^ such that the event 

holds with Q-high probability. 

At first, it follows from the definitions that 

A{z) < Ad{z) < max \Gii{z) - mN{z)\ + A{z). (3.12) 

i 

For z £ Sr{c, 5C,^) and positive number K we define the event 

n{z,K) 

max < Ao(z), max \Gii{z) — mAr(2;)|, max |Tj(z)|, max |i?j(z)|, max |f ij(z)| > > K^{z) 



(jimaxi max\Ui{z)\,\v2iz)\,\v3{z)\,\v^{z)\\ > K^^^ 
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with 



Moreover, we set the events 

E{z) := {Ao{z) + Ad{z) > {logN)-^}, 
T{z,K):=n{z,KyuE{z). (3.13) 

In the sequel, we will frequently use the following large deviation estimates whose 
proof can be found in |21| . 

Lemma 3.4. Let Xi,Xj,i ^ j be two columns of the matrix X satisfying (ii) of 
Condition \1.1{ Then for any M x 1 vector b and M x M matrix C independent of 
Xj and Xj, the following three inequalities hold with Q-high probability 



(U): 



|x*Cx,--TrC|<^||C||H5. 



|x,Cxj| < —\\C\\hs- 



b* I ^^ ' I I !_ I 

X,- < ^^ b 



(Hi) 

Here r := t{tq) > \ is some positive constant. 

Proof. See Appendix B of [21J for instance. D 



To prove Theorem 13.31 we will provide the desired bound for the case of r/ ~ 1 
at first. Then we extend it to the full region Sr{c,bC(^). To fulfill the first step, by 
()3.12p it suffices to prove the following two lemmas. 

Lemma 3.5. Under Condition \l.l\ and ^3.3) . for any C > 0, there exists a constant 
Cq such that the event 

^e5r(c,5C<;),r?~l 
holds with C,-high probability. 

Lemma 3.6. Under Condition \l.l\ and h3.3\) . for any C > there exists a constant 
Cq such that 

holds with C-high probability. 
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At first, we need the following bounds on the elements of Green functions to verify 
Lemma 13.51 and [51 



Lemma 3.7. When z G 5.^(5, SC^;) with 77 ~ 1 , for any T C {!,..., A^} with 
\T\ = 0(1) we have 



with Q-high probability. 

Proof. By spectral decomposition, we have 

Af-|T| 



(3.14) 



Gf (-) = E 



1 



|Ufe. 



;(Tym)|2. 



f^ Xk(wm)-z^ 

Here Afc(VF'-'''')) is the k-th. largest eigenvalue of W^'^' by the notation in (|1.2|) and 

Ufc(T^m) := (u,i(VFm), . . . , Ufc,jv-|T|(VFm)) 

is its corresponding unit eigenvector. Now similar to ()2.2p . we see Ai(T^^^) is 
bounded with (" high probability, thus when z G ^^(c, SC^;) with ry ~ 1 we have 



.(T) 



G\:'{z)\>^G\:'{z) 



(T), 



V 



(XiiWm) - Ef + 7]'^ 



■Y.\u,,{W(^))\'>C-' 



for some positive constant C with (^-high probability. It is similar to show that 
rriAT ~ 1 with (^-high probability. Moreover, we also have the definite upper bound 



|Gl?(^) 



Af-|T| 



E 



^ \kiwm)-z 



Ufc 



:(H^(^))ufc,-(VFm) 



<'-<C 



with some positive constant C. And we also have 
1 1 *^ 1 

Thus we conclude the proof. 

Now we come to verify Lemma 13.51 



D 



Proof of Lemma \3. 51 At first, we claim it suffices to show that for any fixed z £ 
Sr{c, 5C^) with r/ ~ 1, there is some positive constant C^ independent of z such that 

max < Ao(2;),max |Gij(2;) — mAf(^)|,max |ri(2;)|,max |i?j(2;)|,max |fij(z)| > < (f ^^(z) 



(3.15) 



and 



max <^ max\Ui(z)\,\v2{z)\,\v3{z)\,\v4{z)\ } < ip^i^'{z) 



(3.16) 
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hold with (^-high probabiUty. At first, by (|2.33p we have 

||(mg(z)S + /)-^||op,||mjv(^)S + /||op<max n'^''°P ,2J ~1 (3.17) 

when r/ ~ 1. Moreover, we have aheady truncated the variables \/Nxij at (log A^)*-^o. 
Then it is easy to check the derivatives of the quantities Gtj (z) ,Ti(z) ,Ui{z) , Ri{z), 
vii{z), V2{z), vs{z), V4{z) with respect to z are all bounded by 'r]~^° in magnitude 
with some positive constant Aq. Now we can assign an e-net on the region Sr{c, 5Cq) 
with e = N-'^ooAo (gay) . Then it suffices to show (j3.15p and (j3.16p for all z in this 
e-net. By the definition of ^-high probability, it suffices to prove (j3.15p and (|3.16p 
for any fixed z. 

Note that when r/ ~ 1, by using ()3.14p we have 

G,,{z) ~ 1, mN{z) ~ 1 (3.18) 

with ^-high probability. Using Lemma 12.31 and (j3.18p we have 

^(z) = 0(iV~^/2), r/~l. 

By the definition of 0,(z,K), we should bound the following quantities 

Ao(2;), max \Gii{z) — mj^{z)\, max |Tj(2;)|, max |i?j(2;)|, 

i i i 

max\vii{z)\,max\Ui{z)\,\v2{z)\,\v3{z)\,\v4{z)\ 

when r/ ~ 1 one by one. We do it as follows. 
At first, we remind here the basic inequality 

\\AB\\hs<M\op\\B\\hs (3.19) 

for any two matrices A and B. Moreover, we have the following basic bound 

TrG{z) - TrG^'\z) < Oir]-^). (3.20) 

To see (|3.2Up . we denote the ESD of M^jy by Fji^ . Then by Cauchy interlacing 
property we know 

Xn{Wn) < A7v-i(t^f ) < • • • < X2{Wn) < Ai(Vrf ) < Xi{Wn), 
which implies 

sup|F^(x)-F«(x)|<^. 
Therefore, 

\TrG{z)-TrG('\z)\ < AT /" ^_d(F^(A) - F«(A)) 

J {■^ — z\ 

By using formula (ii) of Lemma 13. 1| Lemma 13.41 and Lemma 13. 7t we can get that 
with C-high probability, 

\Ao{z)\ < Cmax\r*g^'^\z)rj\ < max^\\^^/^g'^'^\z)^^/^\\HS 
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, c^_„e<..,.)||„,.c.c<..j5I^ 



C^c, „„ hTrCp^ I ^ c^c hH^lM + o(l). 

^ i^j Y iV^?? ^Af^ ^ y Nrj ^N^ 



Here in the last step we have used the fact that 7/ ~ 1 and 

TrG{z) - TrG^'^\z) < TrG{z) - TrG^'\z) + TrG^'\z) - TrG^'^\z] 

Now by (j3.22|) and the fact that r/ ~ 1 and thus 

QmN{z) ~ 1 
with (^-high probabihty, we have 

\Ao{z)\<G^maK\\g^'^\z)\\HS<0{ip''^^{z)) 

with (^-high probabihty. Similarly, we also have 



(3.22) 



(3.23) 



^ 



^^ max||g«(z)||H5<0(95''^^(^)) 

iV 4 



with ^-high probability. 

For Ad, we start from the observation 



max I Gii — mN \ < max | Gu — Gjj \ , 

i i^j 



(3.24) 



while 



\Gu — G 



jj\ 



-z — zr; 



*a«i 



-z — zr 



*a(j')i 



< 



where T,- is defined in 



GiiGjjl (\Ti -Tj\ + ^iTrgWs - Trg^^^E\] , 
. Note that by using Lemma 13.41 again, we have 



-(^cL\\G(^)(z)yMzjo < Gcn^C — 



:^)i 



\Uz)\ < ^^c_||gW(^)s||^5 < C^^C-\\g(^>{z)\\HS = 0{^^^^{z)) 



(3.25) 



with (^-high probability. Here the last step can be obtained by a similar calculation 
as that for (j3.23p . Moreover, we have 



l|rrg«(z)S - Trg^^\z)^\ < |t/,(z)| + \Uj{z)\ 



(3.26) 



Therefore, we come to estimate Ui{z) below. Now using the Sherman-Morrison 
formula 



{A + r,r*)-' = A-'-j 



+ r*A-^r, 



(3.27) 
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for any M x M invertible matrix A, we have 



mz)\ = l|rr(g«(z)-g(.))s| = l 



r*gWsg«ri 



1 + r*g(i)ri 



1 



IzGiiiz) 



r;g«Sg«r, 



< C 



1 

N 



TrQ'^^T.g^^T. 



+ ^||gWsg^*^S||^5 



N 



with C-high probabihty, where the last step above follows from Lemma 
Observe that 



rrg(^)sg(*)s < rrg«s2(gW)* < C\\Q^^\ 



HS- 



and 



|g«Sg»E||^5 < C||(g«)2||^5 < C||g«||^5. 



(3.28) 



Therefore we have 

with C-high probability. Thus l!^7M\t - ^^7IE\t imply that 

max\Gii{z) - mN{z)\ =0{ip^i^{z)) 

i 

with (^-high probability. 

Hence, it remains to bound 

Ri{z), vii{z), i = l,---,N, and Vkiz), A; = 2, 3, 4. 

To bound these quantities, we recall (|3.17p . For vii{z), we use Lemma [331 again, 
thus we have 

max|t;ii(z)| < max\r*g^''^T.{m'^!}{z)^ + ly^rt - —Tr{m^^ {z)^ + ly'^T.g^'^z)^ 

it N 









I N 

with (^-high probability, where the last inequality above follows from (|3.17|) . Similarly 
we can get that 

\R,{z)\=0{^''^-^{z)) 

with ^-high probability. 

For V2{z)^ we have 



\v2{z)\ < C max r*e«(z)S r(mjv(2)S + /)-!- (mS;^(z)E + /)-i 



r.: 



max 



r:g«(z)S(m^(z)S + I)-\mN{z) - m'^;> {z))J:{m%\z)^ + I)-' 



< max\mN{z)-m%>{z)\( Trg^^{z)T.{mN{z)^ + I)~^^{m}S {z)^ + I^^T. 



iv/^W/ 
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< Cmax|m^(z)-m»(z)| (^lrr|g«(z)| + ^||g«(z)||H5) 

< C\Ao{z)\^ = 0{y^^c^\z)). (3.29) 
with (^-high probability. Here in the last inequality above we have used the fact that 

|m^(z)-mg(z)|=0((Ao(z))2) 

which is implied by (|3.14p and (iii) of Lemma 13. 1[ 

For V3{z), we can use the same approach as what we have used to bound Ui{z) in 
(I32S]). Actually by using ([X^ and (fXTil) again, we have 



b3(-z)| < Cmax 



Irr (g»(z) - g{z)) s(m«(z)s + /)-is 



Cmax — \zGii{z) 

i iV 



^rr(m;;^(z)S + I)-isg(z)S - lrr(m^(z)S + I)-^^G{z)T. 



< Cv^^c max^||g«|||,5 = (^^c max^^ = 0{^<=^^^'{z)){3.30) 

Moreover, the estimate of V4{z) is similar to that of V2{z). Actually, we have 

I ^4 (-2) I < max 

i i\ 

< Cmax\mN{z)-m^^\z)\ = 0{AI) = Oi^^^^^^iz)). 

i 

with C-high probability. Thus we conclude the proof of Lemma I3.5[ D 

With the aid of Lemma 13.51 we can prove Lemma 13.61 below. 

Proof of Lemma \3.6[ Similar to the discussion in the proof of Lemma 13.51 it suffices 
to estimate A(z) for a fixed z. At first, we pursue the idea in [30] to introduce the 
function 

V{u){z) = ({u(z))-' -d],^ J ' dHNit) 



(<"-»<'-»" -*7wi)TT'"'"<'> 



Note that by the fact 

N 

Y,{Gii{z)-mN{z))=0, 



i=l 

it is not difficult to see that 

N 



L^{iGuiz)r'-imNiz)r'] 



N 

i=l 
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1 ^ mNJz) - Gii{z) ( l_ ^ {Gii{z) - mN{z))'^ 



N 



i=l 



o(^^ 



mjf{z) 

{Gii{z) -mN{z)f 






iV^ G,,{z)ml{z) 
with (^-high probability by using Lemma 13.51 and Lemma 13.71 By the fact that 

(G,,(--))-' = - + <;-/ —L—^iH,(t) - Y, 

(fOTI) and (fLTTD we have 

\V{mN{z))\ <\[Y]\+ip^<^>^{z) 
with ^-high probabihty, where 

N 



(3.31) 



[^] = ^^^^- 



i=l 



Therefore by the definition oiT){u{z)) and the bounds for \Yi\^ we know that in the 
case of ry ~ 1, there exists 



-1 



((mAr(z)) ^-{moiz)) ^) + dj^ 



t 



t 



dHN{t) 



< nf'- 



1 



tmo{z) + 1 tmN{z) + 1 

with ^-high probabihty. Taking the fact that mQ{z),mi\i{z) ~ 1 into account we 
obtain 

t^mN{z)mQ{z) 



{Nriyf^ 



{mo{z) -niNiz)) 
with some function 



^ " ^^' / {tmj,{z) + mZ{z) + l]'^^''^'^ 



Cc 



Mz)\<v''' 



1 



(A^r/)V2 

with (^-high probabihty. Now we need to estimate 

_i f t^mN{z)mo{z) 



Hz) (3.32) 
(3.33) 



l-d 



^ J {tmNiz) + l){tmo{z) + l 

when ry ~ 1. 

Note that by Cauchy Schwarz inequahty, we have 

t'^mNiz)'mQ(z)dHN{t) 



-dHN{t) 



-1 

N 



< 



dj^H^\mNiz)\' 
\tmN{z) + 1|2 

Now by (jl.lip we have 



(tmNiz) + l){tmo{z) + 1) 

-U2|_..,M2 \l/2 



dHN{t) 



(iJv^tVo(z)l^ 



dHNit)] 



1/2 



1 + zmo{z) - d^i / /7|^| rfgAr(t) = 0, 
J tmo{z) + 1 



mo{z) + z\mo{z)\'^ - dj^^\mo{z)\'^ / ^_ ^^^ , ^ dHN{t) = 0. 
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we have 

t 
tmo{z) + 1 

Taking the imaginary part of the above equation we obtain 

-Qmoiz) + ry|mo(^)P + dj/\moiz)\^ [ |/'^T11i2 ^^^(^) = «' 

J \tmo[z) + l\^ 

which imphes 

< ^:v7 /'"f >I> H.(«) - 1 - ig^. < 1 - ^ (3^34) 

for some positive constant 5. Here we have used the fact that r/ ~ 1 and (i) of 
Lemma 12.31 Similarly, by the facts 

(mNiz))-^ - d^^ J -—^-—^dHN{t) + z = 5o 

and mN{z) ~ 1 with ^-high probability when r/ ~ 1, we can obtain 

'-"^^ J it^^M^yrw ""^^ = " 9^^^^^^' " ^^°^'^^ ^ ' 

with C-high probability, where 6q{z) is some function such that |(Jq(z)| < tp'-'i /{Nt])^''^. 
Here we have used the fact that r/ ~ 1 and the bound (|3.33p . 
Therefore we have 

t'^mNiz)mQ{z) 
{tmNiz) + l){tmo{z) + 1) 

for some positive constant cq. Hence, by ()3.32p . (|3.33p and ()3.35p we have 

Thus we complete the proof. D 

Now ,we consider to extend the results to the case of r/ <C 1. To this end, we will 
provide the desired bounds ( (j3.15p and (j3.16p ) with the condition that event H'^(z) 
happens at first, then we prove event H'^(z) holds with ^-high probability. Such an 
strategy is also parallel to that of the null case in [30] . Similar to Lemma 13.51 we 
now need the following lemma for the first step. 

Lemma 3.8. Under Condition \l.l\ and i3.3\) . for any C > 0, there exists a positive 
constant Cq such that 

2e5,.(c,5Q) 
with C,-high probability. 

To prove Lemma |3. 81 we will need the following lemma. 



1 - ^]vW ^_ iT^\'r':z. . .. dH^jt) 



> Co (3.35) 
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Lemma 3.9. For z E Sr{c,5Ci^), when the event E^{z) happens , for any Glj {z) 
with |T| = 0(1) and i,j T we have 

max|Gf - Gu\ = 0{Al), gJP ~ 1, A^ := max |G;P(z)| < CAo, (3.36) 

with Q-high probability. 

Proof. Note that when the event S'^(z) happens, we have 

Gii'^1, Ao{z)<{\ogN)-\ 

Now by formula (iii) of Lemma [3. II and induction, we can easily conclude the proof. 

D 

Now we come to show Lemma 13.81 



Proof of Lemma \3.^ At first, we use the discussion in the proof of Lemma [3.51 again 
to claim that it suffices to prove the result for any fixed z € 5r(c, 5C^). 

By the definition in ()3.13p . it suffices to show that (|3.15p and ()3.16p hold with 
^-high probability in '^.^{z). That means, we need to go back to the proof process 
of Lemma 13.51 But this time we have the condition that ^^{z) happens instead of 
ry ~ 1. Note that by definition, in ^^{z) we have 

A{z)<Ad{z)<{\ogNr\ 

Using (i) of Lemma 12.31 we have 

m7v(^)~l, Gii{z)^l, inH^(z). (3.37) 

Note that in the proof o Lemma 13.51 every term except for V2{z) and ^4(2) can 
finally be bounded by some quantity in terms of maxj^j | \Q^^^' \\hs or maxj \\Q^'^' \ \hs- 
Therefore, at first we come to bound ||^^*''^||hs' and II^^^^Uji/^. Note that 






^ .^ + 0(^) + 0(^), (3.38) 

y r/ rj 

where the last step follows from ()3.36p . Now similar to ()3.23p , with the aid of p.37p , 
we see in E^{z), there exists 



„<«,,.)„„., c.c.y«",o(|,.o,l 



Ao(z) < C— -— maxl.^ x-nm^ "T 1/ ,T i -\,T / i "\,r; 



Cc I'^rnNiz) , ^, 1 



where we have used the fact that z £ Sr{c,5G(^). Thus by (iii) of Lemma 12.31 we 
have 

Aoiz) <ip^i^iz) (3.39) 
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with ^-high probability in ^^(z). Inserting (|3.39p into (|3.38|) . we have 

^\\G^''\z)\\hs<'^{z) (3.40) 

with C-high probabihty in H^(z). Analogously, we also have 

^\\g'-'\z)\\HS<^{z) (3.41) 

with C-high probability in ^^(z) 

With the aid of (j3.37p , ()3.40p and ()3.4ip it is not difficult to see (by using Lemma 
13. 9p the estimates between ()3.23p and ()3.28p are still valid in H^(z) for all z G 
Sr{c, 5C^). 

Therefore, it remains to estimate Ri, vu and V2,V3,Vi. Actually, we need the 
following lemma. 

Lemma 3.10. Under Condition \1.1\ for any C > 0, there exists a constant C^ > 0, 
when z £ Sr{c,5C^), we have with Q-high probability, 

\\{m'§{z)^ + I)-%p,\\{mN{z)J: + I)-%p,\\{mo{z)J: + I)-%p < C^, in H'^(z). 



(3.42) 



Moreover, we have with Q-high probability, 

^Tr\g^^{z)\ < {logNf^^\for all z e Sr{c,5C^). (3.43) 

Note that by the estimates for Ri , vu and f 2 , ws , ^4 in the proof of Lemma 13.51 
we can easily see that once Lemma 13.101 holds. Lemma 13.81 follows. Actually, with 
the aid of Lemma 13.101 we can easily check that 

\v2\,\v3\,\vi\<Oi^^(^''), \Ri\,\vu\<Oiip^''^^), k = l,...,N (3.44) 

with ^-high probability in H'^(z). 

Hence, it suffices to prove Lemma l3. 101 below. To this end, we need the following 
crude bound on the counting function of eigenvalues. Hereafter, we will use Ni{A) 
to denote the number of the eigenvalues of an Hermitian matrix A in the interval /. 

Lemma 3.11. Under Condition \1.1[ for any (" > there exists some constant 
C^ > such that for any interval I C [Ci, 00) with length \I\ > ip '- /N , we have 

Ni{W) < C(;N\I\ (3.45) 

with (^-high probability. 

Proof Let rj > ip^^< /N. Note that for any interval I =[E- 77/2, E + rj/2] C [Q, 00] 
(thus E — r]/2 > Ci) we have the elementary inequality 

Ni{W) < CNri'^ruN (z), z = E + ir], 

thus 

N 

Ni{W)<CriY,'^Gii{z). 

i=l 
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Now we assume Nj{yV) > C^Nrj for any large constant Cq to get a contradiction. 
It suffices to show for any C > 0, there exists a positive constant C[ such that 

|9Gi,(z)| < C'^ (3.46) 

with ("-high probabihty if Ni{W) > C(^Nr]. To verify ()3.46p under the assumption 
of Njiys^) > C(^Nr], we rewrite (i) of Lemma l3. II as 

a.. = I 

where Ai(>V(*)) > A2(W(*)) > ... > AAf(>V(*)) are the eigenvalues of W^ and 
^k{y^ ): k = 1 . . . , M are their corresponding unit eigenvectors. Then we have 



< ^ (3 471 

with i^-high probability. Now by Cauchy interlacing property again we know for any 
interval I there exists 

\Nj{Wn) - Nj{W^i^)\ <1. 

Thus by assumption, we have 

\Ni{W^^^)\ = nk ■■ |Afc(W«) -E\< rj/2} > C^Nr] 
with some sufficiently large C^. Now we set the projection matrix 

Pz := Yl Ufe(W«)K(W«))*. 

k:\\UW<--'))-E\<ri/2 

By assumption we have 

TrP, = TrP^ > C^Nr]. 
Thus by Lemma [331 there exists 

Y, (ufc(w«),r,)|2 = x;si/2p,si/2x, 

fc:|Afc(WW)_S|<»?/2 

with C-high probability for some positive constant C'/. Thus by (j3.47p we see that 
()3.46p holds with ^-high probability. Therefore we conclude the proof. D 

Now we come to prove Lemma 13.101 

Proof of Lemma \3.1(K At first, we come to show p.43p . By definition we have 



N 



Tr\Q^Hz)\ = ^Y: 



N f^^\\k{W^^) - z\ 
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Recall the fact that for any interval I C M, 

|iV/(W«)-7V7(W)| < 1. (3.48) 

Note that when z = E + irj € Sr{c, 5C^) one has rj > ip < /N . Now we split R 
into 

M = (-00, A^ - 2c) U IJ 4 U iCr,oo) := /o U Ml 4 U Ik„+i, 

where K^ = 0{rj^^) and 4 are non-intersecting intervals with |Ifc| = r],k = 1, . . . , K^. 
Specifically, 

4 := [A.r -2c, Ar -2c + ?7). 

Now we write 

, -I K-a + l 



Then for z = E + irj €z Sric, 5C^), by invoking Lemma l3 . 1 1 1 and ()3.48p we have 

—Tr\g^'\z)\ <log^WiV 

with ^-high probability. 

Now we start to show ()3.42p . Note that in H'^(z), we have 

\mN{z) - moiz)\,\m%'{z) - moiz)\ < (logiV)"^ 

Therefore, it suffices to show that 

\\{mo{z)^ + I)-%p = 0{l),ioT z G Sr{c,5C^), (3.49) 

which follows from (iv) of Lemma 12.31 immediately. D 

Therefore, we conclude the proof of Lemma 13.81 D 

Moreover, by definitions we also have the following lemma. 

Lemma 3.12. Let K he some positive number such that 1 < K '^ (iVr/)^'^. Then 
for z G Sr{c^ 5C(), in V{z, K) we have 

\V{mN){z)\ < \[Y]\+0{KH\z)) + ^l^^,y (3.50) 

Proof. Note that by the assumption on i^, we have 

K^ < 1, in --"{z). 

Hence, by the fact that Gu{z),mi<[{z) ~ 1 in H'^(z) it is not difficult to see 

N 

E(Gjj(z))~^ = (mAr(z))~^ +0(max|Gii(z) -mAr(2;) 
7 



i 
i=l 



{mN{z))-' + 0{K'^\z)), in Q.%z, K) n H^(z) 
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from (j3.3ip . Moreover, we recall the identity 



{Gu{z))-' = -z + d^i / ——^-^—^dHN{t) - Yi, 



t 
trriN^z) + 1 
which implies 

(m^(z))-i = -z + d^i j -—-L—dHN{t) - [Y] + 0{K^^>\z)), in ^%z, K) n ~%z). 

Then by the definition of T>{u{z)) we obtain 

\V{mN{z))\ <\[Y]\+0{KH^{z)), \n^^z,K)r\'~%z), 
which implies (|3.50p in T{z,K). D 

Now we are on the stage to extend the estimate on A(z) to the case of r/ <^ 1. 
Actually, we have the following crucial Lemma which is an extension of Lemma 6.12 
of |30] to our non-null case. 



Lemma 3.13. Let K = ip^^"^' and L = 0(1) be two positive numbers satisfying 

^^ >K'^{\ogNf. 
And let S be an event satisfying 

sc n r(.,i^)n n ^'(^)- 

z&Sr{c,L) z&Sr{c,L),ri=l 

Assume that in S one has 

\'D{miy{z))\ < 5{z) + ool2(2), for all z G Sr{c, L), 

where 5 : C — )• M^ := {x G M : x > 0} is a continuous function. Moreover, 5{z) is 
decreasing in t] and \6{z)\ < (logA^)^*^. Then there exists some positive constant C 
such that 

A{z) < C(\ogN)^ ^^^ for all z £ Sr{c,L) (3.51) 

-S/K + T] + 6{z) 

holds in S and 

SC n ^'(^)- (3-52) 

zeSr{c,L) 

Proof. Note that we need to prove ()3.5ip and ()3.52p . By definition of H(z) it suffices 
to prove that ()3.5ip holds when 

Ao{z)+Ad{z)<{logN)-^ (3.53) 

and then prove (j3.53p holds for all z E Sr{c,L) when the event S occurs. To this 
end, we define the set of r] 

Ie := {t] : K{E + ifi) + Ad{E + ifj) < {log N)-\yfi > r],E + ifj £ Sr{c, L)} 

At first, we come to show that (|3.5ip holds for z = E + irj with r] £ Ie- By 
assumption, we have 

\V{mN{E + ir]))\<6{E + ir]), ^r/ £ Ie- 
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Note that when -q € Ie, we have 

mN{z) = mo{z) + 0{{logN)-^)r^l (3.54) 

Thus similar to (|3.32p . by the definition of 'D{u){z) here we also have 

t'^mN{z)mo{z) 
{tmN{z) + l){tmo{z) + 1) 



1 - ^^W .... rT^'r.Z':^ , .. dHj,{t) 



Soiz), (3.55) 



(7710(2;) -niNiz)) 

with 

\6oiz)\<0{5{z)), t^gIe- 

Similarly, in order to obtain a bound for A, we need to derive an estimate towards 
the quantity 

However, unlike (|3.35p whose estimation is based on the fact of ry ~ 1, we need to 
do it for general ij this time. Thus more delicate calculation should be taken. 

At first, by the discussion towards ()3.32p . we see ()3.5ip holds naturally when 
7/ ~ 1. Therefore, in the sequel, we will assume rj < t) for some sufficiently small 
constant 77. Now we can write ()3.55p as 

imo{z) -mN{z)) 

= 5q{z). (3.56) 

By (i) and (iv) of Lemma 12.31 and p.54p , we have 

I I U rTXTT7rVTXn2^^iv(t)| < Cfor z G 5.(c,L). (3.57) 

J {tmN[z) + l){tmQ[z) + lY 

for some positive constant C. Now we come to show that when r] < r] and c is chosen 
to be sufficiently small, we also have 

i /tT f^fut\ ,^u2 ^H^^^)\ > C-\ioT z e Sr{c,L) (3.58) 

for some sufficiently large positive constant C. Note that by using (i) and (iv) of 
Lemma 12.31 and ()3.54p again, it is easy to see 



dHNit)- / -——-:.dHN{t)\=0{{logN)-') 



7 itmN{z) + l){tmo{z) + l)'^^^^"'"' J (tmo(z) + l)3" 

when r] £ Ie- Thus it suffices to show that when rj < fj and c is chosen to be 
sufficiently small, there exists 

I [7-^^r-TTsdHN{t)\>C-\foTzeSr{c,L) 
J (tmo{z) + 1)'^ 

For convenience we set 

J{t) := tmoiz) + L 



40 ZHIGANG BAG, GUANGMING PAN, AND WANG ZHOU 

Note that by (iv) of Lemma [231 we know both ?R-J^{t) and 9i7(t) are positive when 

z S Sr{c,L). Let 

9{t):=aTg{Jit))eiO,7r/2). 

It is not difficult to see from the proof of (iv) of Lemma 12.31 that if c is chosen to 
be sufficiently small, we can guarantee that the parameter cq therein is much larger 
than c. Then by (ii) of Lemma 12.31 when rj > fj we also have 



Qmo{z) < C\Jc^f]. 



Therefore, when 17 and c are sufficiently small, we have that ^J{t) is much larger 
than 9v7(t) such that 9{t) < 7r/18(say). That means we have 

< (0(t))3 < 7r/6, thus ^{J{t)f>^\J{t)f 

when z £ Sr{c, L). Then it is easy to see 

J {tmo{z) + l)'^ J {tmo{z) + l)'^ 

for some positive constant C. Then (|3.57p and (|3.58p together imply that 

. . ° dHN{t) ~ 1, 2 G Sr{c, L), rjelE, V < V 

{tmN{z) + l)(tmo(2;) + 1)^ 

when 77 and c are sufficiently small. Therefore, by ()3.56p we have 

a{z){moiz) - mN{z)f + (1 - h{z)){mQ{z) - mN{z)) = 5q{z), (3.59) 

where 

Now we need to provide an upper bound on 

|1-6(.)|. 
Recall the formula in ()3.34p and set 

^^ •= '^N / I, ' ° /il2 ^^^W = 1 - c^ \ ^ < 1- 3.60 

J \t'mQ[z) + l\^ ^rno{z) 

Note that since 1 > c(z) > |&(z)|, it is obvious that 

|1 - b{z)\ > max{l - c{z), \c{z) - b{z)\}. (3.61) 

Thus it suffices to estimate the quantity 

\c{z)-b{z)\. 

To this end, we learn an idea in [12j which is provided for studying the stability 
of the Stieltjes transform of Wat type matrices in the bulk case. Actually, we can 
extend the discussion in [12] to the edge case for our model as well. For ease of 
presentation, we set 

^ tmo{z) 
tmo{z) + 1 
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Then 



Hz)-b{z)\ 



''N 



{\T{t)\^-T\t))dHM{t) 
= d^^ 2 l\'^F{t)fdHN{t)-2i I ^F{t)'^F{t)dHN{t) 

> 2d],^ f{'^F{t)fdHN{t) 

> C{Qmo{z)f [ \Fit)\^dHN{t). 



Qmo{z) 



Here we have used the relation 

^F{t) 

\F{t)\ ~ \tmo{z) + l\\mo{z)\' 
and the fact that 

"^o(-2) ~ li 1 + tmo{z) ~ 1, for z G Sr{c, 0), t E S'pec(S). 
Moreover, by definition and (|3.63|) we also have 

\Fit)\^dHNit) ~ 1. 



(3.62) 
(3.63) 



(3.64) 



Therefore, if 9?n,o(z) ~ 1, we have 

\c{z) — b{z)\ ~ ^inQ{z). 

If Qmo{z) < e for some sufficiently small constant e > 0, we need to estimate 
\c{z) —6(^)1 as follows. Set 

^(t) = e^'^W|^(t)|. 

From (j3.62p we see QF{t) > 0, thus we have G (0,7r). By the assumption of 
Qmo{z) < e, (j3.62p and (j3.63p we obtain 

sin (j) < Ce, 

thus 

I cos 01 > 1/2 

for sufficiently small e. Moreover, by (iv) of Lemma l2.3l and continuity, we have either 
cos(j) > 1/2 or cosi?i> < —1/2 holds uniformly on t € [Am(S), Ai(S)]. Therefore, we 



have 








\c{z) - 


-b{z)\ = djl 


j\F{t)\^-F\t)dHN{t) 






= 2d^^ 


f \F{t)\'^e'^ sin (f)dHN{t) 






> 2d^i 


/ J^(t) p cos sin (/xii^AT 


it) 



2d]^' / \F{t)\^sm^(j)dHNit) 



> d^^ j \F{t)\^{sin(t)-2sin^(t))dHN{t) 
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> {2dNy^ / \T{t)\'^ sm (j)dHN{t) 



> cQmoiz) / \T{t)\^dHN{t), (3.65) 



where in the last step we have used ()3.62p and the fact that sine/) = QJ^{t)/T{t). 
Therefore, by (|3.64p and ()3.65p we always have 

|1 - b{z)\ = \c{z) - b{z)\ > e9mo(z) 

with some positive constant e. Similar to the analysis in (|3.65p . when '^mQ{z) is 
sufficiently small, we also have 



\c{z)-b{z)\ < 2d-^^ 



+ 



2d-^^ I \F{t)\'^s\n^(t)dHN{t) 



\I'{t)\^ cos (I) sin (l)dH Nit) 

< CQmoiz) (3.66) 

for some positive constant C. Moreover, by (j3.60p and (j3.6ip we also have 

|1 - b{z)\ > 1 - c{z) = ^^r? ~ ,^\- (3.67) 

Thus we have for some positive constants e, e', 

|1 - ^(-2)1 > emax{Qmo(z), — -— } > €y/K + r], (3.68) 

ismoiz) 

which is implied by (ii) of Lemma [2.31 Moreover, when '^mQ(z) is sufficiently small, 
by (|3.66p we also have 

|1 - b{z)\ < |1 - c(z)| + \c{z) - b{z)\ = 0(— ^Vt) + Oi'imoiz)) = O(0^+7^iJ3.69) 

Note that the function 6{E + irj) is decreasing in t], thus we can set 

r/i = sup [t] : 6{E + irj) > (log A^)~ (k + 77)} . 
Ie 

Solving ()3.59p we obtain two solutions of mj\f{z) as mNi{z) and mN2{z) such that 



, . I . -(1 - b{z)) + V(l - b{z)Y + AajzMz) 

mNi[z) - mo{z) = K-— , 

0^(2:) 

and 



-(l-6(z))-V(l-b(z))2 + 4a(z)<5^ 

mN2[z) - mo{z) = :r— , 

a'^lz) 

Here the square root is chosen to guarantee that 



V(l-6(z))2+4a(z)5o(^) = (1 - b{z)) + O(5o(z)) 

when r/ ~ 1. 

When QmQ{z) is small enough, by (|3.68p and ()3.69p we see that there exists 



1 - b{z) ~ a/k + ry. 
Therefore, when rj <r]i, by definition we have 

K + r] < {logNy'^ 
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which imphes 

QmQ{z) ^ 1 
by Lemma 12.31 Thus for i] < iji, we have 



\1 -biz)\^./jr+Tj<C (log N)^/6iz). (3.70) 

For r/ > r/i, by the fact that 5{z) is decreasing in r] we know 

6{E + irj) < {logN)-^{K + r]i) < {log N)'^ {k + r]) < C {log N)~^ \1 - b{z)f. {3.71) 

Therefore, ([3770]) and ([37711 imply that 

\'mNi{z) - mN2{z)\ > C|l - b{z)\ > C^k^t], ri > rii 

< C{logN)y^, rj<m. (3.72) 

Observe that in H'^(z) we have 

\mo{z) - mN{z)\ < (log A^)"^ 

Thus, when r/ ~ 1, it is obvious that 

/ N / N / N 4(5o(2:) 
■mN{z) = mNi{z) = mo{z) -\ -, — 

a{z) ((1 - b{z)) + y/{l-b{z))^ + 4a{z)5o{z) 
and 

|mo(2:) — mN{z)\ < 6— — ^ < 6- 



|l-6(z)| - V^T^ 

Now by ()3.72p we know that mNi{z) ^ mj\f2{z) when rj > rji. Therefore, for rj > rji, 
by continuity we have 

mN{z) =mNi{z), 

and 

\mo{z) - mj,{z)\ < C^^^ < C ^S±^ < C- ^^'^ 



\l-b{z)\ \i-b{z)\ + y^ ^K + r] + 5{z)' 

When rj < rji, we have 

\mo{z) - mN{z)\ < |mo(2;) -mAri(z)| + |mAri(2;) -mAr2(2;)| 

<C{logN)y^<C{logN)^^^^==. 

y' K + r] + d{z) 

Therefore, we proved ()3.5ip when r] €z Ie- 

Then it remains to show that when S occurs, Ie is just exactly [ip^N~^, 1]. The 
proof is nearly the same as the counterpart in [30]. However, for the convenience of 
the reader, we reproduce it here. We assume Ie t^ [(p^N^^, 1] to get a contradiction. 
Note that if Ie ^ [(p N~^, 1], we can set ??o = inf I^; which satisfies 

Ao{E + ir?o) + Ad{E + irjo) = (log Ny\ (3.73) 

Then by the definition of T{z,K), we see in S there exists 

Ao{E + iryo) + max \Gii{E + ir]o) - mN{E + ir/o)| 
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< K'ifiE + ir/o) < Kip-^/'^ < (log iV)"^ (3.74) 

by the assumption of (p^ > K'^{logN)^. Moreover, since r/o ^ Ie, we also have 



A{E + ir]o)<C (log N)y^S{ETim) = {{log N)-'^). (3.75) 

Thus (13771) and (13:75]) together imply that 

Ao{E + ir]o) + Ad{E + ir,o) < 0{{log N)-^), 

which contradicts to ()3.73p when N is sufficiently large. Thus we complete the 
proof. n 

Now we can start to prove Theorem I3.3[ 

Proof of Theorem lS.Si Note that by the definitions of ^{z, K) and T{z, K), we have 
for any C > 0, there exist a positive constants C^ such that 

\V{mN{z))\ < \[Y]\+0{max\Gii{z)-mN{z)\^)+(X^lEiz) 

i 

< vj^<;^(z) + oo1h(2), VzG S'^(c,5C^) 
holds on the event 

n r(z,v.^c). 

Z€Sr{c,5C(;) 

Now set 

s= n r(z,^^c)n n E^{z). 

zeSr{c,5C() zeSr{c,5C(),v=l 

and 

5{z) = ip^<{Nr])-^/^ 
in Lemma 13.131 We can get that 

A{z) < f^i{NT])-^^\ Vz G Sr{c,5C(;) 
hold in S. Moreover, by Lemma 13.131 we know 

SC n ^'(^)- (3.76) 

zeSr{c,5C(;) 

Thus we have 

A < Ad < (log N)-\ ^{z) < C{Nri)-^/^, in S. 
Moreover, by the definitions of S, Lemma 13.81 and the fact that 

n -'(-) 

^eS,.(c,5Q),r;=l 

holds with (^-high probability which is implied by Lemma 13.51 and 13.6] we see that 
S holds with (^-high probability. Then by (j3.76p and Lemma 13.81 we also know 

zeSr{c,5C() 
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holds with ^-high probabihty. Therefore, by the definition of ^2(2;, c/^'-^c) we see that 
Theorem 13.31 foUows . D 



3.2. Improved bound on \Y]. In order to prove Theorem 13.21 the main thing is 
to provide a stronger bound on [Y]. More precisely, we need stronger bounds on [T] 
and vi{z). Here 

1 ^ 






Similarly, we will denote 



Recall the decomposition 



1 ^ 



N 

i=l 



4 

Yi = Ti + C/i + y = T, + C/i + J];^;^, 

k=l 

which implies that 

4 

k=l 

Note that by definitions, in T{z,K) n H'^(z), we have 

m,\v2\,\v3\,\v4\ = 0{KH^). 

Such a strong bound is enough for our purpose. Thus our main task in this subsection 
is to improve the bounds on [T] and vi. Actually, we need to prove the following 
Lemma. 

Lemma 3.14. Let K = ip^^^'^ and < L = 0(1) satisfying ip^ > K'^OogN^. 
Suppose that for some event 

ZGSr{c,L) 

we have 

A{z)<A{z)<l, VzeS,(c,L), 
where A{z) is some deterministic number. And we also have 

P(e^) <exp(-p(logA^)2) 
for some 

1 -^p '^mm{p^/^K-\logN)-\^K^~\logN)-^}, (3.77) 

where r is the parameter in Lemma \3.4\ Then there exists an event 0' C G such 
that 

¥{{&')!< I^M-P) (3.78) 
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and for any z G Sr{c,L), we have 



5 1^2 ^f,2 . x.?.^ .?. /S>mo + A 



\[T]\ + \vi\<Cp^K^I^^^ + A^j, ^ := W '; , me', (3.79) 

which implies 

\[Y]\< Cp^K^ (§2 + A^) , in G'. 



Remark 3.15. This lemma can be regarded as an extension of Lemma 7.1 of [30j 
to the non-null case. Moreover, it is also analogous to Lemma 5.2 of [20j . Corollary 
4-2 of |22] and Lemma 4-1 of [13j for Wigner matrices or the adjacency matrices of 
Erdos-Renyi graphs. 

By definitions, we have 

N 






and 



with 



1^1=^^!+ Vi 



iii := -mN{z)—^vii{z) 



i=l 

N 



vi := -— ^(Gii(z) - mN{z))vii{z). 
1=1 

Here vii{z) is defined in ()3.9p . 

Observe that by definition we know in flzeSrfcL)!-'^!'^'-^) '^ '^^i^))^ there exists 
the following bounds 

\vu\, \Gii{z) - mN{z)\ < K^(z). 

Thus in Q there exists 

\vi\ < CK^^^. 

Note that in H'^(z), we also have 

mN{z) ~ 1. 

Hence, it suffices to bound [T] and 



N 



^(^) = T^X]^!*^^)- 

Note that both [T] and uj are in the form of 

iv 
-J](x*^«x,--rr^«) (3.80) 
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with some matrix A^^^ independent of Xj but constructed from all the other Xj, j ^ i. 
If we use the notation Ej to denote the expectation with respect to Xj, we can also 
write 

1 ^ 

i=l 

and 

1 ^ 
^ = ]^ E(l - lE.)r:a«S(mW(z)S + /)-^r,. 

In the sequel, we will use the notation 

keA 

and Qm = 1 — Ej will be simply denoted by Qj. To bound summation in the form 
of (|3.80p , we will adopt the abstract decoupling lemma of [30] (see Lemma 7.3 
therein), which is a large deviation lemma for weakly hierarchically coupled random 
variables. We cite it as the following lemma. 

Lemma 3.16 (Lemma 7.3, [30]). Let Zi,...,Z]\f be random variables which are 
functions of Xij,l < i < M, 1 < j < N. Let O be an event and p an even integer, 
which may depend on N . Suppose the following assumptions hold with some constant 
Ci,ci >0. 

(i) There exist some deterministic positive numbers X < 1 and y such that for 
any set A C {1, 2, . . . , N} with i £ A and |A| < p, Q^Zi in Q can be written as the 
sum of two new random variables as 

iie){QAZi) = Zi^A + i(G)QAi(e^)z,,A 

and 

\Zi,A\ < y{CiX\A\)\^\, |Zi,A| < yN^'\^\. 

Here 1(6) represents the indicator function of Q. (ii) For Zi, we have the rough 
definite bound 

max|Zi| <yN^^. 

i 

(Hi) For Q we have 

P(e'=) < exp(-ci(logiV)^/^:p). 
Then, under the assumptions (i), (ii) and (Hi) we have 

eIn-'Y^Q^zA < {Cp)*P[X^ + N-^fyP (3.81) 

for some C > and any sufficiently large N . 

Remark 3.17. We remind here similar results also appeared in some previous 
works, such as Theorem 5.6 of [13j and Lemma 4-1 of |22j . 
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Remark 3.18. Note that once we have li°j.81\) . we can use Markov inequality to get 

P [ \N-^ Y. '3i^il > Cyp^[X^ + N'^] J « i exp(-p). 

With the aid of Leinina 13.161 we now come to prove Lemma 13.141 

Proof of Lemma \3.14\ Note that it suffices to find an event Q' £ Q satisfying (|3.78p 
such that 



5 7>^2,f,2 



|[r]| <cp'K^'^ 



(3.82) 



and 



|w| < Cp^K^ (^2 + A* ' 

in Q' . To simphfy the discussion on oj we introduce the quantities 
Vi-. = r*g«S(mo(z)S + /)-iri 

vii : = QiVi = r*g»S(mo(z)S + I)-\i - ^Tr(mo(z)S + ly^^G^^T. 

and 



N' 



UJ 



1 ^ 



i=l 



It suffices to show that in 0', there exist (j3.82p and 

\uj-Cj\<C (kA^ + K^^A , 

\u\ < Cp^K^^^. 
At first, we come to show ()3.83p . Note that by definition, we have 

Qi fr*g«S f(mi;^(z)S + I)-' - (mo(z)S + I)-' 



(3.83) 
(3.84) 



Vli -Vli 



(i), 



m%>{z) - mo{z)\ Q, (r*g'^''>^{m%>{z)^ + I)-'^{mo{z)^ + ir'r. 



N 



'2,T,2\ 



< CK{A + A^)* < C{KA^ + K-^^-^) < C{KA^ + K^^ 

Here in the first inequahty above we have used the fact that \m^{z) — 7710(2;) | < 
A + CAq and in the last step we have used the fact that K'^ <C 1 which is imphed 
by the assumption on K and L immediately. Note here we have used the fact that 
in T{z,K) n H^(z), there exists 

\Ri\ = \Qi (r*g»S(mi;)(z)S + /)-iS(mo(z)S + I)-\^ \ < K^>{z). 

Thus it remains to show ()3.82p and ()3.84p . We will adopt Lemma [3. 161 to prove both 
of these two inequlities. 

We remind here that we learn the proof strategy which will be used below from 

At first, we come to deal with the simpler one ()3.82p . Note that 

z + zv*g^^Vi = -{Gii{z)r\ 
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Thus 

Ti = Q.,{zG^^{z))-\ 
Since z £ Sr{c, L), it suffices to bound 

1 ^ 

Now we can set Zi = {Gii{z))~^ in Lemma 13.161 Then by Lemma |3 . 1 6 1 and Markov 
inequahty it suffices to show that for iGAc{l,2,..., N} and |A| < p there exists 
Zj_A and Zj,A satisfying 

i{e)iQAZi) = Zi^A + i(e)QAi(e^)Zi,A, z.^a < y{cx\A\)\^\, z,,a < yN^^^\ 

(3.85) 

with 

X = K^, y = C (3.86) 

for some positive constant C. The proof of ()3.85p is totally the same as the coun- 
terpart in the null case, see the proof of Lemma 7.4 and Lemma 7.1 of [30]. The 
proof only depends on the equations listed in Lemma 13.11 thus is also valid for our 
non-null case. So here we omit the details. Our main target is to prove ()3.84p in the 
sequel. 

By Lemma 13.161 it suffices to show that there exists an event G with probability 

P(e) > 1 - ex.p{- {log Nf/'^p) 

such that for i G A C {1, 2, . . . , N} and |A| < p there exists i>i^A and i>j_A satisfying 

i(e)(QAi>i) = j^i,A + i(e)gAi(e^)z>i,A, i>i,A<yicx\A\)\^\, i>i,A < J^^v^'^i 

(3.87) 

for some constant C > 0, where X and y are specified to be those in (|3.86|) . 

Note that by the fact r] ^ N^'^ in Sr{c, L) and the truncation on Xij, we see (|3.87p 
holds naturally when A = {i}. Thus we can always assume |A| > 2 in the sequel. 
Now let A = A{yV) be a function of W and denote A^'^^ = A{yV^'^^). Pursuing the 
idea in [13], we define 

jn,v._^ J2 (-I)IVU(V)^ 

s\ucvcs 

for any S,U C {1,2,... ,A^}. Then by definition it is not difficult to see for any 
S C {1, . . . , A^}, there exists 

A= Y. _4S,u_ 
0CUCS 

Moreover, it is not difficult to see A^'^ is independent of the /c-th column of X if 
k £ S\l]. Thus we have 

QsA = Qs Yl -4^'" = Qs^'''^ 
0CUCS 
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which imphes that 

Now we set 

It is easy to get the bound of Pj^a in (|3.87p . thus we only need to handle z^j^a below. 
That is to say, we shall show that for 2 < |A| < p, there exists 

l(G)(i>)^\«'*\« <y{CX\A\)\^\, X = K^, y = C. (3.88) 

Now we need the following two lemmas. 

Lemma 3.19. Let p be an even number satisfying 

l^p^^^/^K-^(iogN)-\ 

then in Cl^eSric L) (^(-2, K) n H^(z)), for any T with \T\ < p, there exists 

max I G,- • I < C max I Gji I , min I G,- • I > c min I Ga I , 

^'^TrG^'^^ < ^'^TrG + Cp{K^^^ + -^) 
with some positive constants C, c. 
For ease of presentation, we denote 

Mo := Mo{z) = {mo{z)J: + I)-\ 
And now we set the event 
T{z) :=P|{r*g(^u^^'J»(z)SMo(z)r* < K^iz) for all T e {1, . . . ,iV},i,i T, |T| < p} . 

Then we have the following lemma 

Lemma 3.20. Let p be an even number satisfying 

1 -^p '^mm{-K^~\logN)~^,^^/^K-\logN)-^}. 

Then in C\zeSr(cL) (^i^^^) ^ ^'^(^))> '^^ have PlzGSrCcL) '^i^) holds with probability 
larger than 

l-exp(-CK^"') 

with some positive constant C . 

In the sequel, we will assume 

Gc n (r(^,^)nH^(z)nT(z)). 

Now we prove (|3.88|) assuming Lemma 13.191 and 13.201 at first and postpone the 
proofs of these two lemmas after that. As a warm up, we start with the case of 
|A| = 2 and |A| = 3. 
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Let A = {i,j} with some j ^ i. Note 

(J>)A\{^},A\{i} = p{j},{i} = ^ _ ^(i)_ 

Note that for any set T we have the relation 

*^(Tu{M})r. - z-i '^ for i i (/T 

Thus we have 

< 0{X^) in e. (3.89) 

If |A| = 3, without loss of generality, we assume that A = {i, j, /c}, it is easy to see 

= {G,,-Gf){Gur'r';g'^''^^Mor, 

+G\f{{Gu)-' - (G;fV)r*a(*-'')SMor, 
+G[f{G['^^)-Hr*g^'^^^Mori - rp^'^'^^^MoTi). (3.90) 

Note that for any set T such that i,j, /c T, there exist 

^(T) _ ^(TU{A:}) _ ^(T).^(T)._i^(T) 
'-^ij '^ij — '^ik y'^kk ) ^kj ' 

and 

r*g(Tu{^,,})sMor, - r*g(^u^^'^>»SMor, = G5^^'»(G5.;^^^'»)-i4g(^u^^'^>»EMor,. 

(3.92) 
Thus we have 

(^^)A\«,A\w = r*g('^)^Mor,{Gu)~'G,k{Gkk)-'Gk, 

-T*g^'^^j:Mor,{Gur'G,k{Gkkr'GM{Gl^Y'Gf 
+vig^^^T.M^,(G^)-^dif(G^r^G^ 
< 0(^-3), in e. 

Now we use the idea in |T3j (see Section 5.2 therein) to introduce a class of rational 
functions in resolvent matrix elements. For fixed positive integer n we define 
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(i) a sequence of integers (ir)"=i satisfying i^ 7^ i^+i for 1 < A; < n — 1 while 
h = in+i', 

(ii) a collection of sets {Ua)a=i satisfying ii,i2 G Ui and ia,ia+i l]a,o: > 2, 
and C Uq, C A for all 1 < a < n; 

(iii) a collection of sets (T^)o^2 satisfying i^ T^, and C T^ C A for all 
2 < /? < n. 

Then we define the random rational function 

D {{Q'^+l (U„)S=i, (¥^3)^=2) := ^, (3.93) 



where 

n n 



a=2 



Note that in the cases of 1^41 = 2 and 1^41 = 3, we can write (i>j)'^\i*j''Mln as a 
summation of rational functions in the form of 

±I)((v):!;^(U«)S=i,(T;3)^=2)- (3.94) 

Actually, for general S we have the following lemma. 

Lemma 3.21. Let S C {1, . . . , A^} and i S, then we have 

2|S| X„ 

n=|S| + l fc=l 

where 

2S 

Y^ K„<4l^l|S|!, 

n=|S|+l 

and each D^^k is in the form of 13.94\) , with appropriate chosen sets {J]a)'%=i o-nd 
(T^)o^2 which may be different for each Fn,k- 

Proof. Set 

A = Su{i}. 

We prove it by induction. Note that we already proved the cases of |A| = 2,3. At 
first, it is not difficult to check the following relation, 

i}A\{i} - rr,.\A\{i,i},A\{i,i} _ ((r,.\^\{h3}A\{i,3}^^^^ 



for any j ^ i while j G A. Now we assume that (i>j)*\i*Jj''*'\i*Ji can be written in 
the form of 

2|A\{i,j}l A-n 

(^^)A\{.,,},A\{M} = £ D„,, D^ = YDn,k 

n=|A\{j,j}| + l fc=l 
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with some Dn^k in the form of (|3.94|) and K„, satisfying 

2|A\{ij}| 

^ i^„<4|A\{M}||A\{i,j}|!. 

n=\A\{i,j}\ + l 

Then 

2|A\{i,i}| 2|A\{i,i}| K„ 

(,^)A\«,A\«^ ^ ^D^_Di^))= Y. E(D 

n=\A\{t,j}\ + l n=|A\{ij}| + lA:=l 

Now we assume that there are (ir)"=i , {^a)a=i, {^13)^^2 such that 

?i n _, 



n,k ^n,kr 



a=2 



+1 ±x \ '/s.^/S 

2 



Then 



^n,. - dI^I 



™ ™ _1 



a=2 /3=2 

n 



,il3j 



-rigi^Mmj:M,r., n G^f n (g: 



(U.U{j}) T-T /^(T^U{j})\-l 



a=2 



13=2 



10,^13 



Now obviously Dn^k ~ ^nk *"^^ ^^ written as a summation of (2n — 1) terms, and 
every term contains one and only one factor in one of the following form 



^(Uc,) 



Q{^c.yj{j}) 



^a ?^q: + 1 ia ?^q: + 1 






-1 



^(T/3U{i}) 



-1 



Now using (j3.90p and (j3.9ip we see that Dn^k — ^nk ^^ ^ summation of terms in the 
form of (|3.93|) with n replaced by n + 1 or n + 2. Moreover, the total number of 

these terms in Dn^k — ^nk ^^ "^^ niore than 2n — 1. These facts together imply that 
we can write 

2|A\{i}| K'^ 
n=|A\{J}|+lfc=l 



and 



2|A\{i}| A-; 




E E 


< 


n=|A\{J}|+lA:=l 





2|A\{ij}| 

Y, Kni2n - 1) 

"=|A\{i,i}l+l 

< 4l^\«l|A\{i}|! 



Thus we can complete the proof by induction. 



n 
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Now by the definition of Dn , Lemma 13.191 and 13.201 we can find an event 
GC fl {Tiz,K)nE%z)nT{z)) 

Z(iSr{c,L) 

with probabiHty 

P(G) > 1 - exp(-p(log 7V)^/2) 
such that ()3.88p holds, thus we conclude the proof of Lemma [3. 141 

Now we come to prove Lemma 13.191 and 13.201 
Proof of Lemma \3.19[ Note that by (iii) of Lemma 13. H we have 



D 



iGuV' = {G^r' - -^^^ = (1 + o{x')){G'i^r' 






(j) 



and 



ig(')|< 



Gi 



GikGkj 



Gkk 



<K{l + 0{X)), i^j, 



i]\ 



(3.95) 



which implies 

minlGjf I > (l-0(A'))min|Gii|, maxicjf | < (1 + O(A')) maj 

i^k hjy^k •' i,j 

Thus we can get the first two inequalities of Lemma 13.191 by induction and the 
assumption that pX <^ 1. 

Now we come to show the third inequality of Lemma 13.191 Note that for z G 

Sr{c,0) 



— '^Trg^'^^ - —QTrg 
N N 



< 



^r^TrG^^) - —QTrG 



N N 

- AT Z^ I ** **l ' ^ JV 



.0,^, 



J^T 



N' 



Now by ()3.95p and (iii) of Lemma 13.11 it is not difficult to see that 



dp-Giil <pK'^ 



2iTf2 



Therefore, we conclude the proof. 



n 



Proof of Lemma V3.2(A Similarly, it suffices to prove the result for any fixed z £ 
Sr{c,L). By using Lemma 13.41 and (iv) of Lemma 12.31 we have that 



r*g('^'Jii'^})(z)J:Mo{z)r* < K h\g^'^'^^''^^\z) 

■I j\j 

holds with probability larger than 

l-iV^exp(-C"i^^'') 



\HS 



Ks 



'9rrg(Tru{i,i})(^) 



N'^T] 
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with some positive constants C,C'. Now by Lemma 13.191 and the assumption on p 
one has 



r*g(Tu{^,i})(^)SMo(z)r* < kJ^^^^ + ^^{z) < CK^{z), in r{z,K) D H'=(z). 

Now note 

KT C {1, . . . , TV} : |T| < p} = OiNP+^). 
Thus in T{z,K) D H'^(z), T holds with probabihty larger than 

by the assumption on p. Thus we conclude the proof. D 

3.3. Strong local MP-type law around A,.. Now we come to prove Theorem 
13.21 The proof is rely on a bootstrap strategy we have mentioned in Introduction 
and is similar to that of the null case in [30j. The main difference is that the 
iterate rate in the bootstrap process for our non-null case is slower than that of 
the null case in [3D]. Roughly speaking, we will show that if for some exponent 
r, A(z) < {Nri)~'^ holds module some logarithmic factors with high probability, 

there also exists A(z) < (Nrj) i module some logarithmic factors with high 
probability. 

Proof of Theorem \3.B . We assume C ^ 1 ■ By Theorem 13.31 Lemma 13.81 and Lemma 
13.201 we know that for any C > 0? there exists a positive constant C(^ such that 

Gic n {nz,v^^)nE'{z)nT{z)) 

holds with (C + 2(log 1.3)^^)-high probability. In the sequel, we assume 

Cf > T(10C + 20(logl.3)"^), (3.96) 

where r > 1 is the parameter in Lemma |3.4[ By Lemma 13.121 we know 

\V{mN)iz)\ < C(^2^c^2 + \[Y]\, in Gi. 
Now let Ai = 1, thus A < Ai in Bi. Set 

p = pi = -log(l-P(ei))/(logiV)2 
in Lemma 13.141 Thus we have 

p, = C(^C+2(logl.3)-/^l^g^^2_ 

1 T -I n/O 



Recall the parameters K, L used in Lemma 13.141 here we have L = lOCc, K = tp^'- 



Thus it is easy to check ([STTl) by (l3:96]) . 

By Lemma 13.141 we know for z G Sr{c, WC,^) there exists an event 02 such that 

GaCOi, P(G2) = 1 - exp(-pi) 
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and 



|[y]|<^2Cc+5C+io(iogi.3)-(^2^^^^^)^ M.1 := J^!I5^±Ai, inG2. 
Then in 02, we have 

\V{mN){z)\ < 992Q+5C+10(logl.3)-i(^2 ^^^^^)^ (397) 

Note that the r.h.s. of (|3.97|) is decreasing in ry and by (j3.96|) it is less than (log N)^^. 
Then by using Lemma 13.131 we have 

A(.) < A,{z) := ^2C,+6C+io(iogi.3)- (" ^1 + ^^l^ ] . (3.98) 

Now note by using the fact that 

9mo(z) < 0(a/kT^), in Sr{c, WCq) 
one can easily get that 



*? 



Moreover, in 02 we have 



< 



CAl^\Nr])-^^^. (3.99) 






Vk + ?7 + Ai*i - (K + ,y + Ai^i)i/4 (Ai^i)V4 

Now note that we have 

11 . ^ 

< ^= , (3.101) 



,,,, . , , Jl!^<c({Nr])-^+Ay\Nr]) 



Nri 

and 



Nri' 



Inserting ()3.10ip and (|3.102p into (|3.100p we have 
Ai^i 



(3.102) 



= <C (Ai/^(iV77)~3/8 + Ai/^(iVj?)-i/4) . (3.103) 



Combining (j3.99p and p.l03p we have for z G ^^(c, lOC^), there exists 
A < A2 < ^2Cc+6C+io{iogi.3)-i (^A^ "(iV^)-i/4) « 1. 

Then iterate this process for Kq := log log A^/ log 1.3 times, we have 

&Ko C Qko-1, ^i^Ko) = 1 - exp{-pKo) 
with 

PKo = -log[l -P(0i,o_l)]/(logAr)2 = C(^C+2(logl.3)-l(iQg^)-2i^o > <^C 
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and in Qkq one has 

A{z)<Ak,{z) := ^2Cc+6C+10(logl.3)-l^V4_^(^^)-l/4 
< j^2Cc+6C+10(log 1.3)-i (Ar7y)-l+(3/4)'^0 

where in the last step we have used the fact 

which can easily verified by the definition of Kq. Thus we complete the proof of 
(|3.1U|) by adjusting the constant C(^ in Theorem 13.21 Moreover, by Qkq C 0i and 
the definition of ©i we obtain (|3.1ip . D 

4. Universality for the largest eigenvalue 

In this section, we will prove the universality for the largest eigenvalue. Actually, 
with the aid of the local MP type law, the proof of the edge universality can be 
obtained through a routine discussion as that in the null case. The proof process is 
analogous to that of [30j. As we have sketched in Introduction, the Green function 
comparison strategy was developed in the series works [201 EU [22l [H] for generalized 
Wigner matrices and Erdos-Renyi graph, and extended to sample covariance matri- 
ces in the null case in [30J . One main novelty in pO] is that the authors replace once 
a column instead of once an entry in the swapping process. Such an idea allows one 
to extend the Green function comparison approach to prove the edge universality 
even for some random matrix model with weakly dependent column entries such as 
the correlation matrices. One can refer to [7j and [31j . In our non-null case, we will 
also adopt this idea. To this end, at first we need to translate the information on 
mN{z) to the eigenvalues around A^. More precisely, we will prove the rigidity of 
the locations of the eigenvalues on the right edge. Then rely on the rigidity of the 
eigenvalues, we can pursue a Green function comparison strategy to prove the edge 
universality. 

One main difficulty in this part is that we need to provide some bounds for the 
entries of S^'^^S^'^ and S^'^^^S^'^(see Lemma 14.81 below), which is a necessary 
input for the Green function comparison process. When E is diagonal, these desired 
bounds can be obtained through a delocalization result on the eigenvectors of Wn- 
However, for more general S, the issue will be much more complicated. In more 
general case, we turn to seek for a rough bound on ^{T}-''^QT}''^)kk, /c = 1, . . . , M. 
It will be seen that such a rough bound is enough for the Green function comparison 
strategy assuming the two matrices are matching to order 4. Thus the main novelties 
in this part are one delocalization estimate for the eigenvectors of Wat in the edge 
case for diagonal S and a rough bound on 3'(S^'^^S^' ^)fcfc, A; = 1, . . . , M for general 
E. Especially, for the later one, our main idea is to provide a rough bound for the 
diagonal entries oiT?-i'^{yV — zq)~^T}i'^ with zq = E + ir]Q , where rjQ = A^~2/3+e ^ ^^ 
Then by this rough bound we can obtain a rough bound on (S^/^UjU*^^/^)^^, A; = 
1, . . . , M, where Uj is the unit eigenvector of W corresponding to Aj which is around 
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Xr. Then the bounds for (S^/^UjU*!;-'^'^)^^ can turn to help us to provide a rough 
bound on 9(Si/2gsV2);,^. 

Now we start to expand the strategy summarized above in details. At first, we 
need the following rigidity result on the right edge. 

Theorem 4.1 (Rigidity of the eigenvalues around Xr). Under Condition \1.1\ and 
/ [g.^l) . for any C > there exists a constant C^ such that the following events hold 
with Q-high probability. 

(i): For the largest eigenvalue Ai(W), there exists 

|Ai(W)-A^| < A^-2/V^«. 

(ii): For any 

Ei,E2 G [Xr — C,Cr], 
there exists 

\{Fn{Ei) - Fn{E2)) - {Fo{E,) - Fo{E2))\ < ^(^"g^)'^''^ . (4.1) 

Proof. Relying on the strong local MP-type law, the proof is analogous to the coun- 
terparts in [22J and [30j. Note that by Lemma |2.H it is easy to see 



l-Fo(A.-</^^«iV-2/3) 



yjS'-C 



N 
Then if (ii) of Theorem 14.11 holds, we have 

Xi{W) > A^-iV-2/3^Q^ 

Thus it suffices to prove that (ii) and 

Ai (W) <Xr + N-'^/^ip^i (4.2) 

hold with ("-high probability. 

At first we come to verify ()4.2p . We recall the fact that 

Cl<Xi{W)<Cr 

holds with (^-high probability. 

Therefore, to show ()4.2p . it suffices to prove for any ( > 0, there exists some 
C^ > such that 

max{Aj(W) : Xj{W) < CJ < A^ + iV^^/s^Cc (43) 

with (-high probability. To this end, we recall the iteration process in ()3.97p and 
()3.98p . In the sequel, we set C,^ = 5D(^. By (i) of Theorem [321 we know that with 
(■-high probability. 



for some constant D^, C > 0. Thus we have 

A(.)<C(logAr),,-c^.^, 5:=?!^^. 
Vk + r? + d Nr] 
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Now for any E > Xr + N "^'^ip^^i, we choose 

Note here 

It is easy to check the following relations, 



'^^^'''V, ]^»1- (4-4) 



Note that by using (ii) of Lemma 12.31 we have 

Qtuq^z) = C 

thus by the last inequality of ()4.4p we have 



and thus 



Then we can get 



9mo(z) <. —- (4.5) 

Nr] 



^^A^ + (^^)' 



A(z) < Cip^i ( ^ + ^7^1 -r^ « ^. (4.6) 

^ ^ ~ ^ \K Nr]y/Kj Nt] Nr] ^ ^ 



where we have used the fact that 

Then (|4.5p and (|4.6p together imply 

QniNiz) < — -. 
jyr] 

Now by the basic relation 

N{E -r],E + r])< CN'q'=smN{z) < 1 

we know there is no eigenvalue in [E — r],E+ri], thus ()4.3p is verified, so ()4.2p follows. 

For (ii) , with the aid of Theorem l3.2l the proof of ()4.ip is just the same as the proof 
of (8.6) of [30j. The main strategy is to translate the closeness between rriN and mo 
to that of the spectral distributions (F/v and Fq). Such a strategy is independent of 
the matrix model. Actually, we only need to set the interval [j4i,742] = [A^ — c,Cr] 
in Lemma 8.1 of [30j and the remaining part of proof is totally the same as the 
counterpart in [30j. Thus here we do not reproduce the details. 

Therefore, we complete the proof. D 

In the sequel we will pursue the Green function comparison approach used in 
[50] to prove the universality of the largest eigenvalue. The main idea is to use 
a functional of Green function to approximate the eigenvalue statistics, and then 
use a Linderberg strategy to prove the limiting behavior of this functional of Green 
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function is universal. Therefore, the first step is to represent the distribution of the 
largest eigenvalue of Wjsf in terms of Green function Gm{z) in an approximate sense. 
Note that by using the square root behavior of Pq{x) in Lemma 12.11 again, it is 
easy to see for any positive constant C^ there is 

-C 

when A^ is sufficiently large. Together with ()4.ip we immediately get that 

Fjv(Ar + 2c^^ciV-2/3) - FN{\r - 299^ciV-2/3) < I__ (4.7) 

with (^-high probability. Moreover by Theorem 14.11 we also have 

|Ai(W) - A^l < (^^ciV-2/3 (4.8) 

with C-high probability. Thus to show Theorem 11.31 it suffices to assume that 

-ip^i < s <ip'^(. 
Now set 

E^ := Xr + 2ip^i N~^/^ . 
For any E < E(^ we denote 

XE ■= Me,E(;]- 
For ease of presentation, we denote 

Ur][X) := — ^ — = / 2 I 2\' 

TT X — IT] 7r(X^ + 7?^j 

and the number of eigenvalues of Wm in an interval [Ei, E2] by 

M{Ei,E2) := N{Fm{E2)-Fm{Ei)). 
Note by definition we have 

M{E,E^) = TrxE{H). 
Observe that 

1 r^c 



TrxE-i * 0^{Wn) = N- / 9mjv(y + iv)dy, 

71" Je-i 

which can be viewed as a smoothed version of the counting function TrxE{H) on 
scale rj. An obvious advantage of TrxE-i * Or/iW^) is that it can be represented in 
terms of the Stieltjes transform. The following lemma claim that we do can replace 
TrxE{H) by its smoothed approximation Ttxe-i * ^r?(W^7v)- 

Lemma 4.2. Let ?yi = Ar-2/3-9£ ^^^ ^^ _ ^-2/3-3£ j^^ ^^y ^ > q. jj j^ satisfies 

\E-Xr\ < -v9^«A^-2/^ 

we have 

\TrxE{H) - TrxE * 0^, {H)\<C (A^-^. + _^(^ -h,E + h)) 
holds with C,-high probability. 
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Proof. The proof is similar to that of Lemma 4.1 of [30J. Thus we will just sketch it 
below. By the assumption on E, (j4.7|) and (|4.8p . it is not difficult to check that we 
can use (4.9) of |30j, that is 

\TrxE{H)-XE*0vAH)\ < C{M{E - h,E + h) + N'''') 



+ CNr,i{E^ -E) I —^^mN{E -y + 



ih)dy. 



By the bounds on Ai(>V) in (|2.2p . we know 

1 



L 



r.'^mNiE -y + ili)dy = 0{1) 

y>Cr y^ + H 



IE- 

with ^-high probability. Now we further split the interval [— oo, Cr] into [— oo, A^ — c) 
and [\r — c, Cr] ■ When E — y ^ W — c, Cr] , we have 

\QmN{E -y + ih) - Qmo{E -y + ili)\ < -r-rr- 

l\ li 

with ("-high probability. When E — y £ [— oo, A^ — c), by assumption on E we have 
y > c/2. Thus we have 

/ ^ '^mN{E -y + ili)dy = / QmN{E - y + ili)dy + 0(1) 

J-oo<E-y<Cr y + h J \r-C<E-y<Cr 

Moreover, it is not difficult to get 

/ „ '^mN{E -y + ili)dy 

J Xr-c<E-y<Cr y + h 

f 1 / s f^^ f 1 

JXr-c<E-y<Cr y + H J^h J\r-c<E-y<Cr y + K 

with C-high probability. Then by the assumption on E and the definitions of rji and 
/i, it is not difficult to obtain 

iV77i(^C -E) I -:T^^mN{E -y + ih)dy < N'^' 
Jmy + 'i 

through elementary calculation. Thus we conclude the proof. D 

With the aid of Lemma|121 it is not difficult to see Corollary 4.2 of [30] also holds 
for our non-null case. We cite it below as the following lemma. 

Lemma 4.3. Let rji = N^'^'^^^'^,1 = ^N^'^'^^'^ and h{x) be a smooth cut-off func- 
tion satisfying 

h{x) = l, if |x| < 1/9, ,h{x)=0 if |x|>2/9, h'{x)<0, if x>0. 

Then we have 

Eh{TrxE-i*e^,{WN))<F{XiiWN)<E)<EhiTrxE+i*er,dWN)) + 0{eM-V^')) 
when N is sufficiently large. 
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Lemma 14.31 allows us to represent the distribution of Ai(Wiv) in terms of the 
expectation of a functional of the Stieltjes transform in an approximate sense. Ac- 
tually, by the discussions in [3D] (see the proof of Theorem 1.1 therein), to prove 
Theorem 11.31 it suffices to prove the following Green function comparison theorem. 

Theorem 4.4 (Green function comparison theorem around A^). Let W" and W^ 
he two sample covariance matrices in Theorem, \1.S\ Let f he a real function with 
derivatives satisfying 

I f'^^MxM 

with some positive constant C . Let e he some sufficiently small positive constant and 
let E, El and £'2 be any real numhers satisfying 

I E — Aj. I , I E\ — Aj. I , I E2 — Aj- 1 !^ A* ' 

Moreover, we set rj = N^'^'^^^ . If either A or 'Q in Theorem M.^ holds we have for 

z = E + irj, 

|E/(iV??9mXr) - E/(iVr/9mXf)| < 7V"^'^ (4.9) 

and 

/ (■E2 \ / J-E2 

EfiN 9mXf(x + ir/)dxj -E/(iV / 'im,]^{x + ir])dx 

with some positive constant C when N is sufficiently large. 



T-C'e 



< N-^ " (4.10) 



Remark 4.5. Similar to Theorem 4-4 of [30J, we actually need an extension of 
[J^TW to verify Remark \1.4\ and Remark \l.b\ That is , for hounded smooth function 
(7 : M'^' — )• M with hounded derivatives, and real numbers Ef^ < ... < Ei < Eq 
satisfying \Ei — Xr\ < N^'^''^^'^ ,i = 0, . . . ,k and rj = N~'^'^~^ with some sufficiently 
small constant e we have 

/ rEo f-Eo 

IKg I N '^mff{x + ir])dx, . . . ,N / 3'm]^(x + iri)dx 

V Jei JEk 

(rEo rEo 

N Qm]^{x + ir])dx,...,N Qmjf{x + iri)dx ] < iV-^"f4.11) 

Jei J Ek J 

with some positive constant C when N is sufficiently large. The proof of ([JTTP ^'^ 
similar to that of ^J^W) thus we leave it to the reader. 

Now we are at the stage to prove our main results. 

Proof of Theorem \1.3[ Given Lemma 14.31 and Theorem 14. 4| the proof of Theorem 
11.31 is the same as that for the null case. We refer to the proof of Theorem 1.1 of 
|30J for details. D 



T-C'e 



Proof of Theorem \1.5[ Theorem 11.51 follows from Theorem 11.31 Theorem 1 of [10] 
and Proposition 2 of [28] immediately. D 

It remains to prove Theorem 14.41 in the sequel. 
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Proof of Theore'm \4.4\ We will only verify (|4.9p below, the proof of (j4.1Up can be 
taken similarly. Let 7 S {1, 2, . . . , A^ + 1} and set X^ to be the matrix whose first 
7 — 1 columns are the same as those of X^ and the remaining A'^ — 7 + 1 columns are 
the same as those of X". Thus we have Xi = X^ and Xjv+i = X"^. Correspondingly, 
we set 

Wn,j = X*i:X^, Wn,^ = S^/2j^^j^*5]i/2^ 
To verify ()4.9p . it suffices to check that for every 7 the following estimate holds 

EF {riQTr{WN,^ - z)-^) - EF (??9rr(Wiv,7+i - z)'^) = 0{N-^~^''). 
Note that X^ and X^^i only differ in one column, thus 



which implies 



7 ~ ^7+1' 



w(7) _ TT/(7) yw(7) _ va;(7) 



Therefore, we can write 

EF {r]'^Tr{WN,y - z)'^) - EF {r]'^Tr{WN,'y+i - z)'^) 



EF (r/9Tr(W^7V,7 - z)'^) - EF [ri'^[Tr{W^ 



Af,7 



Z) — Z 



- (eF {rjQTr{WN,^+i - z)-') - EF (r]'^[Tr{W^^}^^, - z)-' - z-^])) 

(4.12) 

Set 

Mk{i) = {(E(K\/iVxji)'(9\/iVxji)™, j, l,m) :j = l,...,M,m + l<k} 

be the set of first k moments of the entries of Xj indexed by j, I, m. By ()4.12p it 
suffices to show for any sample covariance matrix Wn satisfying Condition II. H the 
following two lemmas hold. 

Lemma 4.6. For any random matrix Wn satisfying Condition ] 1.1{ for z = E + irj 
if there exists 

\E - Xr\ < iV-2/3+£^ iV-2/3-s < r/ < iV-2/3, 

then we have 

EF{Nr]QmN{z)) - EF(iV7?9[mS^^(z) - (iVz)"^]) = A{X^'\M2{i)) + iV"^"^i;4-13) 

when Ti is diagonal and 

EF{Nr]'^mN(z)) -EF{NT]'^[m'--}(z) - (iVz)"^]) = B{x'-'\Mi{i)) + N-'^-^lA.U) 

for general S, where A{X^^\ M.2{i)) is afunctional depending on the distribution of 
X^*) and A42{i) only and similarly B[X^^' ,A44{i)) is a functional depending on the 
distribution of X^^' and Aii{i) only. 

Now we denote the set M.k{i) for X" and X^ by A^"(i) and M.J,{i) respectively. 
Then we also have the following lemma. 
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Lemma 4.7. Under Condition \1.1\ i/W" matches W" to order 2, we have 

max\A{Xl^^\M^{-f))-A{X^^\M^{-f))\ = 0(e-^«(i°g^)'') (4.15) 

with some positive constant cq and C > 1. IfW^ matches W" to order A, we have 
max|S(X^-^),A^^(7)) - B{X^^\MI{^))\ = 0(e-^°('°s^)''). (4.16) 

Here A and B are the Junctionals in Lemma \4-6[ 

It is obvious that (j4.9p follows from Lemma 14.61 and Lemma 14.71 The proof 
of (j4.10p can be taken analogously. Thus we conclude the proof of Theorem 14.41 
assuming the validity of Lemma 14.61 and Lemma 14. 7[ D 



Proof of Lemma \4.6[ Without loss of generality, we just check the result for i = 1. 
By the discussion in [30j(see (4.24)-(4.30) therein) we have with ^-high probability 
the following estimate holds, 



F{Nr]'^mN{z)) - F{Nr]Q[m^^\z) - (Nz)-^]) 
3 , 



k\ 
fc=i 



where 



y = r?zGiirt(g«)2ri. (4.18) 

Here we have used the fact that 

NmN{z) - N[ni^^\z) - {N z)-^] = TrG - TrG^^^ + z^^ 

= TrQ - TrQ^^^ = -zGurlig^^^fn. 
Moreover, note that by 

|ri(^(^))'ri - lrr(g«)2s| < ^^1^^^^^ 

with (^-high probability, we have for z satisfying the assumption in Lemma [4.6t there 
exists 

I n^ ; ii - ^ 1^ I V- ^^ _ K ) 

with (^-high probability. Thus (j4.17p holds with ("-high probability. 
Now let 

_ mp - Gil 
Gil ' 

then with ("-high probability we can write 

00 
Gn = ^^=moY,i-D)'^ (4.20) 

fc=0 
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since with ("-high probability |-D| < A^ i/3+2e -^j-^ich is implied by (ii) of Theorem 
Inserting (|4.2U|) into (j4.18p we can write 



k=l 

By (|4.19|) . we know 

\y\ < 0{N~^/^+^'). 
Moreover, by the bound on D, we can easily get 

Then it is not difficult to find that 

EF(iVr?9mAr(z)) - F{Nr]Q[m''^\z) - (Nz)-'^]) 

+EF(^\Nrj'^[m^i\z) - (iVz)-i])(i(%i)2 + %i%2) 

+EF(3)(iVr/9[m£^(z) - {Nz)-^]){-Qyif + 0{N-^/^+^^) (4.21) 

6 

Now we need the following lemma. 

Lemma 4.8. AssuTne that z = E + irj satisfies 

\E - \r\ < iV-2/3+£, Ar-2/3-e < ^ ^ iy-^/S. 

Then when S is diagonal, we have 

\{Q^^\z)\,\ < N""' , and \{[G'^^\z)]%j\ < N^'^+''' (4.22) 

with Q-high probability. 

For general S under (Hi) of Condition \1.1\ we have 

|9(Sl/2gW(z)S^/2)^^| ^j^l/3+Ce (4^23) 

with Q-high probability. 

Now we prove (|4.13|) and (|4.14|) assuming Lemma [4. 81 and prove Lemma [4.81 later. 
At first, we come to verify (|4.13p when S is diagonal. Recall (j4.2ip . we start from 
the third term on the r.h.s of (j4.2ip . Denote 

w = Qr]zmQ, g = ^firjzmQ. 

Thus we have 

%i = zui^rlig^^yfri) + £,(9rKa«)'ri) 

For ease of presentation, we denote the real part and imaginary part of a complex 
number A by ^[0] and A[l] respectively. Then it is easy to see that Ei(9yi)^ is a 
summation of some terms in the form of 

l{a+b=3}r^''Q' Y. tl{^'^\G^'^f^'^\..^.k,M]Eflx,^m, (4.24) 

ki,--- ,kQm=l 1=1 
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where ai,/3j are or 1 and a,b are nonnegative integers. Thus it suffices to deal 
with quantities in the form of (|4.24|) below. 

Now we classify the terms in the summation (|4.24|) by {fci, • • • , feg}. If there is a 
kj appears only once in {ki, ■ ■ ■ , /oe}, then such term is zero obviously. If every kj 
appears exactly twice, then such term only depend on X^'^' and A^2(l)- Therefore, 
it remains to consider the terms that there is at least one kj appears at least three 
times and no kj appears only once. It is obviously that the total number of such 
terms is 0{N'^). Now by using (|4.22p in Lemma |4.8[ we see the total contribution 
of these terms is less than 

CN~^\7]zmofmax\{^^/^{g^^^f^^/%jf < CN-^\rjzmof ma^\{g'-^^f)ijf = 0{N-^+^') 

ij ij 

with (^-high probability. Therefore, we have 

E(%i)3 = Ai{X^^\M2{l)) + 0{N~^+^') (4.25) 

for some functional Ai depending on the distribution of X^^' and A^2(l) only. 
Analogously, by invoking ()4.22p we can get 

E ((%i)2 + 2(%i)(%2)) = ^2(X«,A^2(1)) + 0(7V-5/3+c.) (4^26) 

and 

E(%i + %2 + 92/3) = A3{X^'\M2{1)) + 0{N-^/^+'^') (4.27) 

with some functionals A2 and A3 depending on X^^' and A^2(l) only. Inserting 
()4.25p - ()4.27p into ()4.2ip we obtain the conclusion when S is diagonal. 

Now we come to prove (|4.16p for more general S. Note that for Ei(3'yi)^, by 
()4.24p . it suffices to bound the terms in which all ki,i = 1, . . . , 6 are the same, since 
all the other terms only depend on the distribution of X^^' and A^4(l). In other 
words, we need to bound the terms in which E|xjj|^ appears. It is not difficult to see 
that the total contribution of such terms in Ei(9yi)^, Ei ((Qyi)^ + 2{Qyi){Qy2)) 
and E(9'yi + 9=y2 + Qys) can be bounded by 

CiV-3j;r?3|(sV2(g(i))2sV2)^,^|3 

k 

k 
+ CiV-='5;^|(SV2(g(l))SV2)^^|2|(sl/2(g(l))2sl/2)^^| (4.28) 

k 

with some positive constant C. Now note the elementary relation 

|(sV2(g(i))2sV2)^^| <^-i|c^(sV2(g(i))sV2)^^| <^-i|(sV2g(i)si/2),,|. (4.29) 

Thus it suffices to bound the last term of (|4.28p . Moreover, by the ffist inequality 
of KlU\i and (105I1 we see 



\i^y\gWf^y\,\<N 



l+Ce 
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Now we come to estimate the last term of (|4.28|) . 

k k 

< iV"8/3+C-^^|(SV2^«SV2)^^.|2 < Ar-8/3+C.pl/2^(l)5.(^(l))*5.1/2j^^ 

k j 



kk\ 



< C7'Ar-8/3+c^(E^/2g(i)(g(i))*5.i/2^ 
Thus we complete the proof of ()4.14p . 
Now we come to verify Lemma |4.7[ 



kk 



^/^-8/3+C.^-lc^(5.1/2^(l)5.1 



/2' 



kk = 0{N 
D 



-5/3+Ce\ 



Proof of Lemma |^.?| Note by the discussion in the proof of Lemma 14.61 it is easy 
to see (1^5]) and (fiJ6]l hold by Definition O and Lemma \TM □ 



At the end, we come to prove Lemma 14.81 

Proof of Lemm,a \4.8[ Since Q^^' and Q are only different in dimension, we will just 
work on Q for ease of presentation. We will prove ()4.22p and (|4.23p separately. 

Proof of IJi4-22 ). Note when S is diagonal, we can denote 

S = diag((Ti,...,cr|^). 

Let E'-'J be the (M — 1) x (M — 1) minor of S obtained by deleting the j'-th column 
and row of S. Moreover, we denote the j-th row of X by ^* and the (M — 1) x A^ 
submatrix obtained by deleting X* from X by X^^'. Correspondingly, we will use 
the notation 

gy] = (wj^^ - z)-\ Gl^'l = (VfI^'I - zr\ 
Then 

/ ^i-^r \ 

cr2'^2 
Wat = . {aiXi,a2X2,- ■ ■ ,aMXM) 

alXlXx (JiA'i*(AW)*(SW)1/2 

cti(sW)1/2xWa'i (sW)1/2xW(aW)*(sW)1/2 

Then by Schur complement we have 

1 1 



Qx 



-z - za\Xl ((AW)*SWxW - z)'^ Xi 
Now observe that 



-z-zalX^G^^^Xi 



X;G^^^Xi - —TrG^^^ 
^ N 



Pc 



< ip'-'i 



I^TrGi^^ 
A2?7 
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holds with (^-high probabihty. Now note that we have 

TrG^^^ = Trg^^^ - ^~^ + ^ = TrQ - ^~^ + ^ + 0{r^-') = TvG + 0{r'). 

Therefore, we have for z G 5r.(c, 5C^) 

XlG^^^Xi = mN{z) + o(l) = mo(z) + o(l) 
with (^-high probabihty, which imphes 

^11 ^ 2 TT"; — 77\- 

—z — zafmo[z) + o{l) 

Analogously, we have 

-z - zafmo{z) + o{l) 

with (^-high probability. Then by (iv) of Lemma 12.31 we obtain for z G S'j.(c, 5C^), 

\gii(z)\ = 0(1) (4.30) 

with ^-high probability, thus we get the bounds for Qa when E is diagonal. Now we 
use ()4.30p to derive a delocalization result on the eigenvectors of Wat in the edge 
case. Then we use the delocalization result to bound Qij and {Q)fj. 

We denote the unit eigenvector of Wat corresponding to Xk{W) by u^ = (u^i, ■ ■ ■ , uj^m) 
We have the following lemma. 

Lemma 4.9. When S is diagonal, if XkiyV) £ [Ar — c/2,Cr], we have 

max|MfcjP < V^'^'^TT 
i i\ 

with Q-high probability. 

Proof of Lemma \4-!^ By (j4.30p and spectral decomposition, we have 

M 

^^^^ = g(A,(W)-V+.^'""'' = ''^'^ 

Now we set rj = ip'^''^N~^. Since (j4.30p holds uniformly for all z £ Sr{c, 5C(^), we set 
E = Afc(>V) for some Afc(>V) G [A^ - c/2, Cr], then 

"^ \uk^\^ = ip-^'^N\uk^\^ = Oil), 



(Afc(W)-^)2 + r?2' 
which implies Lemma 14.91 immediatelv. Thus we conclude the proof. D 

Now we proceed to prove ()4.22p . Note that by spectral decomposition, for z = 
E + irj satisfying the assumption in Lemma 14.81 and a = 1, 2 we have 

M ^ ^ 

\{G'"{z))ij\ < Y, \x (yv) _ zlJ ^'kiWukjl < Yl \X _^\2 \'^ki\\ukj\+0{l) 

A:=l ' ''^ ^ ' fc:Afce[Ar-c/2,Cr] ' ^ ' 

1 *^ 1 
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Then the first inequahty in (j4.22p (a = 1) can be obtained through a discussion 
similar to that in the proof of (|3.43|) based on Lemma 13.111 When a = 2, we see 

1^1 

fc=i I ''^ ' I 
which imphes the second inequahty in ()4.22p . D 

Now we come to prove ()4.23p . 

Proof of \4^.2'J ). For ease of presentation, we wih work with Q instead of Q^^' . Note 
that 

5.1/2^5.1/2 ^ 5.1/2 1^5.1/2^^*5.1/2 _ ^^^ -1 5.1/2 ^ ^xx* - zT.-^y^ . (4.31) 

For convenience, we wih denote 

$ := T.-\ A := T}/^QY}/^. 
Thus we have 

Afcfc = [{XX* - z^)-\k- 

We will only provide the estimate for QAn in the sequel, the others can be estimated 
analogously. We need the following lemma. 

Lemma 4.10. Let zq := Eq + ir^Q satisfy 

Eo G [A, -c,Xr+ Af-2/3+-] , ryo := iV-2/3+^o- 

with some positive constant Aq > 1. Under Condition {TTJl for any constant C > 
we have 

|Aii(zo)| < Cvo'^^ (4.32) 

with Q-high probability. 

We postpone the proof of Lemma I4.1UI to the end of this section and proceed to 
prove (j4.23p assuming Lemma |4.1U[ By (j4.32p and spectral decomposition we have 

with C-high probability. Now if Aj(W) G [A,. — c, A^ + A^'^/'^+^J, we can set £"0 = Aj, 
thus (|4.33p implies that 

{E^/'^UiU*^^/'^)kk < Cr]l^^ < Ar-i/3+Ao£_ (434) 

Now we use ()4.34p to bound 9Aii(z) for all z satisfying the assumption of Lemma 
By using spectral decomposition again we have 

M 



< ]\f-U^+Aoe y^ 



rj 



(Ai(W)-£)2+??2 
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i:Xi<\r—c 

< ^l/3+2Ao£ 

with ^-high probability. Thus ()4.23p follows if we replace Q by Q^"^' D 

Therefore, we conclude the proof of Lemma 14.81 D 

It remains to prove Lemma |4.10[ We do it as follows. 

Proof of Lemma \4.10 . At first, we recall the notations X* , X^^\ Ti"\ W"^ in the 
proof of (j4.22p . Moreover, we write 

where (pn is the (1, 1) entry of ^ and <I>i is its first column with (/)ii removed. Now 
by using Schur complement to ()4.3ip we can get that 

xix^ - zo^Pu - {x*{xmy - zo<^i) (xW(xW)* - zomy' {xmx, - zo<^i) 
1 

Di + L»2 + -Ds ' 
where 

-1 



D2:=D2{zo) = -zo'cj)* xW(xWr-zo^™ ^1, 



D3:=Dsizo) = zoX*{xm)*(^xm(xmy-zo^^'^) '<i>i 

Note that 

|An| < min{(|9(Di + D2 + Ds)\)-\ mOi + D2 + DsT^}. 

Hence it suffices to show that with ^-high probability, 

\'^{Di + D2 + D3)\ > Cvl^^ (4.35) 

when 

\^{Di + D2 + D3)\ < N^/^ (4.36) 

for some positive constants C. In the sequel, we will prove ()4.35p under the assump- 
tion of (jraj. 

Note that 
Di = X*Xi-zo<pu 
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-1 



For ease of presentation we denote 

Thus by Lemma 13.41 we have 

Di = -zockii - zo^rrGW(zo) + O (^\\G^^\z^)\\hs^ 

with ("-high probabihty. At first, similar to ()3.43p . we can show that with ^-high 
probabihty. 



|^TrGW(^o)|<(logiV)^(^). (4.37) 



Moreover we have 



||GW(^o)lk.<cJ-^^^^^''^^°^ 



which together with (|4.37|) imply that 

^Di{zo)< {log Nf^^'^ (4.38) 

and 



^D,{z,) = -E,l^^Trm{zo) + U^^^ ^^'^^^f °M + 0((logiV)^(So) 

with (^-high probability. 
Now for D2, we have 

^D2{zo) = -{El-r,l)m\[x^^\x^^^r-zo^^^'^y^<^i 

+2Sor?o9^i (^'^k^'^¥ - ^0^^^') ^1, (4.39) 

^D2{zo) = -(£;o'-%')»^l(^f"(^f'lr-^o^™)"'^i 

-2£;o^oK$i (^'^k^'^¥ - ^0^^^') ^1- (4.40) 

And for D3, we denote 

b := zo(xW)* (xW(xW)* - zo$W)^^ $1, 

thus 

D3 = Afi*b + b*Afi*. 
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Then by Lemma 13.41 we have 



lAsI < 



V 



Ibll < 



V 



with (^-high probabihty. Observe that 



$*|xW(XW)*-zo^Wr^i 



$t 



xW(xWr-zo^w 



<51 



$*($W) 



W\-l/2 



;ci>W)-V2j^W(xW)*($W)-V2_^Q 



;($[11)-1 ($W)-1/2^W(XW)*(CI> 



W^-1/2 



zo 



-1 



($W)-l/2$^ 



-2 



< CcDK^W)-^/' ($W)-i/2xW(xW)*(cI>W)-i/2_^g (ci>W)-V2cj,^ 



< Cr?o-i9cI>K<J>W)"'/' ((cI>W)-V2xW(xW)*($ 



)[l]^-l/2 



^0 



(cI.W)-l/2cI>, 



which impHes that 



-1 



$1 



ID.^I < 



if'^C 



\/riQ^'^<^*i (XW(XW)* - zo^my^ <5i 



(4.41) 



with ^-high probabihty. Now note that we have the crude bound 

9$t (XW(XW)* - zo^f^')"^ ^1 < Ct]q\ 
Then by (f08]l . (|09]l and (|OT]l we can see that 

\^{Di +D2 + Ds)\> C|K$*i (xW(xW)* - zo^™) "^ ^i| + ©(iV^/^"^"). 
Then by assumption ()4.36p we have 

\^^l ('xW(xW)* - zo^W)"^ $i| < 0(7V^/6) (4.42) 

Inserting (|4.42|) into (|4.40p we have 

QD2 = -{El- Tily^l (xW(xW)* - zo$W)~^ $1 + 0(iV-i/2+Ce)^ (4 43^ 
Now we claim that 

^rrGW(zo) = "i^(^o) + 0(^), l9rrGW(zo) > C^^i^+T^, (4.44) 

where kq := \Xr — Eq\. Now we wih prove ()4.35p assuming (|4.44p and postpone the 
proof of ()4.44p to the end. With the aid of ()4.44p . we have 



For convenience, we set 



(4.45) 



to = 9^I (xW(xW)*-zo$W) ^$1. 
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Combining (|OT|) . (|0^ and ([O^ we have 

with C-high probability. Now observe that Eo^m^^zo) and (Eq — r/o)*o have the 
same sign, thus it suffices to show that 






(^ ^ 1 /2 0? '^ 1 II , 

to > ^^^0 ' or ^^V < V/^o + % 



holds, which is obvious by the definition of ryo- 

Therefore, it suffices to verify (|4.44|) in the sequel. Now recall W'^^' and G'^1 
defined in the proof of (j4.22p . We start from the observation 



Q\,_, \ ( Q*N 



where 0„ is n x 1 zero vector. Then it is easy to see that 

ranA;(W"-iyW) < 4. 
Moreover, we can write 

ail T.\ 

Si sw 

Then by <1>S = Im-, we can easily get 



xw 



Thus 



Hence we have 



ranA;((xW)*($W)-ixW - W^W) < 1. 



ranA;(VF - (xW)*($W)-ixW) < 5. 

Now we denote the ESD of (XW)*($[i])-iX[i]) by F^. Then we use the rank 
inequality (see Lemma 2.2 of fl] for instance) 

\Fn - i^W| < — ranA:(TF - (xW)*($W)-ixW) < — . 
Then similar to ()3.2ip . we can get ()4.44p . Thus we conclude the proof. D 
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